MST124 
Essential mathematics 1 


Book C 


The Open 
University 


This publication forms part of an Open University module. Details of this and other 

Open University modules can be obtained from the Student Registration and Enquiry Service, The 
Open University, PO Box 197, Milton Keynes MK7 6BJ, United Kingdom (tel. +44 (0)845 300 6090; 
email general-enquiries@open.ac.uk). 


Alternatively, you may visit the Open University website at www.open.ac.uk where you can learn 
more about the wide range of modules and packs offered at all levels by The Open University. 


To purchase a selection of Open University materials visit www.ouw.co.uk, or contact Open 
University Worldwide, Walton Hall, Milton Keynes MK7 6AA, United Kingdom for a brochure 
(tel. +44 (0)1908 858779; fax +44 (0)1908 858787; email ouw-customer-services@open.ac.uk). 


The Open University, Walton Hall, Milton Keynes, MK7 6AA. 
First published 2014. 
Copyright © 2014 The Open University 


All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, transmitted 
or utilised in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without 
written permission from the publisher or a licence from the Copyright Licensing Agency Ltd. Details of such 
licences (for reprographic reproduction) may be obtained from the Copyright Licensing Agency Ltd, Saffron 
House, 6-10 Kirby Street, London EC1N 8TS (website www.cla.co.uk). 


Open University materials may also be made available in electronic formats for use by students of the 
University. All rights, including copyright and related rights and database rights, in electronic materials and 
their contents are owned by or licensed to The Open University, or otherwise used by The Open University as 
permitted by applicable law. 


In using electronic materials and their contents you agree that your use will be solely for the purposes of 
following an Open University course of study or otherwise as licensed by The Open University or its assigns. 


Except as permitted above you undertake not to copy, store in any medium (including electronic storage or 
use in a website), distribute, transmit or retransmit, broadcast, modify or show in public such electronic 
materials in whole or in part without the prior written consent of The Open University or in accordance with 
the Copyright, Designs and Patents Act 1988. 


Edited, designed and typeset by The Open University, using the Open University TREX System. 
Printed in the United Kingdom by Halstan & Co. Ltd., Amersham, Bucks. 


ISBN 978 1 7800 7359 0 
1.1 


Contents 


Contents 
Unit 7 Differentiation methods and integration 1 
Introduction 3 
1 Derivatives of further standard functions 3 
1.1 Derivatives of sin, cos and tan 4 
1.2 Derivatives of exp and In 10 
2 Finding derivatives of more complicated functions 14 
2.1 Product rule 14 
2.2 Quotient rule 19 
2.3 Chain rule 23 
2.4 Differentiating functions of linear expressions 31 
2.5 Using the differentiation rules together 33 
3 More differentiation 38 
3.1 Optimisation problems 38 
3.2 Differentiation using a computer 44 
3.3 Derivatives of inverse functions 46 
3.4 Table of standard derivatives 54 
4 Reversing differentiation 55 
4.1 Antiderivatives and indefinite integrals 56 
4.2 Antiderivatives of power functions 61 
4.3 Constant multiple rule and sum rule for 
antiderivatives 64 
5 Particular antiderivatives 67 
5.1 Finding particular antiderivatives 67 
5.2 Using integration to work with rates of change 68 
5.3 Changes in the values of antiderivatives 74 
6 Antiderivatives of further standard functions 78 
6.1 An antiderivative of the reciprocal function 78 
6.2 Table of standard indefinite integrals 81 
Learning outcomes 83 
Solutions to activities 84 


Acknowledgements 101 


Contents 


Unit 8 Integration methods 
Introduction 


1 Areas, signed areas and definite integrals 
1.1 Areas and signed areas 
1.2 Definite integrals 


2 The fundamental theorem of calculus 
2.1 What is the fundamental theorem of calculus? 
2.2 Using the fundamental theorem to find signed areas 
2.3 Using the fundamental theorem to find changes in the 
values of antiderivatives 
2.4 Notation for indefinite integrals 


3 Integration by substitution 
3.1 Basic integration by substitution 
3.2 Integration by substitution in practice 
3.3 Integrating functions of linear expressions 
3.4 Integration by substitution for definite integrals 
3.5 Finding more complicated integrals by substitution 


4 Integration by parts 
4.1 Basic integration by parts 
4.2 Integration by parts in practice 
4.3 Integration by parts for definite integrals 


5 More integration 
5.1 Trigonometric integrals 
5.2 Choosing a method of integration 
5.3 Integration using a computer 


Learning outcomes 
Solutions to activities 


Acknowledgements 


103 
105 


106 
106 
119 


126 
127 
133 


146 
151 


155 
156 
162 
167 
172 
174 


176 
176 
180 
185 


189 
189 
192 
194 


196 
197 
216 


Unit 9 Matrices 


Introduction 


1 Matrices and matrix operations 


1:1 
12 
1.3 
1.4 


Matrix notation and terminology 
Matrix addition and subtraction 
Scalar multiplication of matrices 
Matrix multiplication 


2 Matrices and networks 


2l 
2.2 
2.3 


Networks 
From networks to matrices 
Combining networks 


3 The inverse of a matrix 


3.1 
3.2 


Identity matrices 
Matrix inverses 


4 Simultaneous linear equations and matrices 


4.1 


4.2 
4.3 


Solving simultaneous linear equations in two 
unknowns 

Non-invertible coefficient matrices 
Simultaneous linear equations in more than two 
unknowns 


Learning outcomes 


Solutions to activities 


Acknowledgements 


Index 


217 
219 


221 
221 
223 
226 
228 


238 
239 
242 
245 


251 
251 
253 


257 


258 
262 


265 
269 
270 
278 


279 


Contents 


Unit 7 


Differentiation methods and integration 


Introduction 


Introduction 


The first half of this unit continues with the topic of Unit 6, differential 
calculus. You'll meet the formulas for the derivatives of five standard 
functions, namely sin, cos, tan, exp and In. You’ll also learn some further 
ways of differentiating complicated functions by combining the formulas 
for the derivatives of standard functions. 


By the end of this part of the unit you should have acquired skills in 
differentiating a wide variety of functions. You’ll need these skills in the 
later parts of the module, and in any further modules that you study that 
involve calculus. So it’s important that you become proficient in them. 
The only way to do that is to practise, so you should aim to do all the 
activities in this unit, and as many further activities in the practice quizzes 
and exercise booklets as you have time for, until you feel confident with 
your skills. 


In the first half of the unit you'll also learn how to use your computer to 
find and work with derivatives, and you'll see some further applications of 
differentiation. 


In the second half of the unit you'll start to learn about integral calculus, 
which can be thought of as the reverse of differential calculus. You’ve seen 
that in differential calculus you start off knowing the values taken by a 
continuously changing quantity throughout a period of change, and you 
use this information to find the values taken by the rate of change of the 
quantity throughout the same period. In integral calculus, you do the 
opposite: you start off knowing the values taken by the rate of change of a 
quantity throughout a period of change, and you use this information to 
find the values taken by the quantity throughout the same period. This 
process is called integration. The topic of integral calculus continues in 
Unit 8. 


1 Derivatives of further standard 
functions 


In this section you'll meet the formulas for the derivatives of the five 
standard functions sin, cos, tan, exp and In. 


Unit 7 Differentiation methods and integration 


1.1 Derivatives of sin, cos and tan 


In this first subsection we’ll obtain formulas for the derivatives of the 
trigonometric functions f(x) = sina, f(x) = cosx and f(x) = tana. 


When youw’re working with these functions, it’s essential to remember that 
the input variable x is the size of an angle measured in radians. This is 
indicated by the formulas for the functions, because x represents a 
number, and when the size of an angle is given just as a number, with no 
units, then the units are assumed to be radians. You met this convention 
in Unit 4. For example, sin 10 denotes the sine of 10 radians, whereas 

sin 10° denotes the sine of 10 degrees. 


When we’re dealing with the derivatives of trigonometric functions, we 
always measure angles in radians. This is because it makes the formulas 
for the derivatives of trigonometric functions simpler. If you’d like to know 
more about this, then, once you’ve completed Sections 1 and 2 of this unit, 
read the document Why do we use radians? on the module website. 


Derivative of sin 


Our first aim in this subsection is to obtain a formula for the derivative of 
the sine function, f(x) = sina. 


As you know, the derivative of a function is itself a function, and therefore 
the derivative has a graph. So, to get a rough idea of what the derivative 

of the sine function might be like, let’s try to roughly sketch its graph, by 
estimating the gradient of the graph of the sine function at various points. 


Figure 1 shows the graph of the sine function, drawn on axes with equal 
scales, and some tangents to the graph. 
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Figure 1 The graph of f(x) = sin x 


When «x = 0, the gradient of the graph of the sine function seems to be 
about the same as the gradient of the line y = x; that is, it seems to be 
about 1. As x increases, the graph gradually gets less steep — that is, its 
gradient gradually decreases, until x = 7/2, when the gradient seems to be 
zero. The gradient decreases slowly when zx is only a little larger than 0, 
but decreases more rapidly as x gets closer to 7/2. 


In other words, for x between 0 and 7/2 the value of the derivative seems 
to decrease from 1 to zero, slowly at first, but then more rapidly. So its 
graph must look something like the sketch in Figure 2. 
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Figure 2 A sketch of the graph of the derivative of f(x) = sin z, for x 
between 0 and 7/2 


Now look at the graph of the sine function for x between 7/2 and 7. The 
gradient starts at zero, then it becomes negative. At first the graph is not 
very steep — that is, the gradient has small negative values — but it 
becomes steeper and steeper — that is, the gradient takes negative values of 
greater and greater magnitude. Eventually, when x = 7, the gradient 
seems to be about the same as the gradient of the line y = —2; that is, it 
seems to be about —1. The gradient decreases fairly rapidly when x is only 
a little larger than 7/2, but decreases more slowly as x gets closer to 7. 


So, for x between 7/2 and 7, the value of the derivative seems to decrease 
from 0 to —1, fairly rapidly at first, but then more slowly. So the graph of 
the derivative must look something like the sketch in Figure 3. 


Figure 3 A sketch of the graph of the derivative of f(x) = sin z, for x 
between 0 and 7 


In the next activity you’re asked to extend the sketch in Figure 3 to cover 
more values of x. 


Activity 1 Extending the sketch of the graph of the derivative of sin 


By considering the gradients of the graph of the sine function (shown in 
Figure 1) for x between 7 and 37/2, and then for x between 37/2 and 2r, 
work out what the graph of the derivative of the sine function must look 
like for these values of x. Hence extend the sketch of the derivative of the 
sine function in Figure 3 to cover all values of x between 0 and 27. 


Unit 7 Differentiation methods and integration 


In Activity 1 you should have worked out that the graph of the derivative 
of the sine function for x between 0 and 27 must look something like the 
sketch in Figure 4. 
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Figure 4 A sketch of the graph of the derivative of f(x) = sin z, for x 
between 0 and 27 


Since the graph of the sine function repeats every 27 units, its gradients 
also repeat every 27 units. So you can now extend the sketch of the graph 
of the derivative to cover any interval of values of x that you wish. For 
example, Figure 5 shows a sketch of the graph for values of x between —7 
and 37. 


Figure 5 A sketch of the graph of the derivative of f(x) = sin z, for x 
between —7 and 37 


The graph in Figure 5 should look familiar to you — it looks very like the 
graph of the cosine function! So it seems possible that the derivative of the 
sine function is the cosine function — in other words, that 


if f(x) = sing, then f'(x) = cos zx. 


This is indeed true. It can be confirmed by using differentiation from first 
principles, but this is quite tricky to do. If you’d like to see some details 
about how it’s done, then have a look at the document The derivative of 
the sine function on the module website. 
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Derivative of cos 


Let’s now look at obtaining a formula for the derivative of the cosine 
function, f(a) = cosa. You could get a rough idea of what this derivative 
must be like by using a similar procedure to the one that you’ve just seen 
for the sine function. Figures 6 and 7 show the graph of f(x) = cosa, and 
the sketch of its derivative that’s obtained from this procedure. 


Figure 7 A sketch of the graph of the derivative of f(x) = cos x 


The graph in Figure 7 looks like the graph of the sine function, reflected in 
the x-axis. That is, it looks like the graph of y = —sinz. So it seems 
possible that the derivative of the cosine function is the negative of the 
sine function — in other words, that 


if f(x) = cosg, then f'(x) = -— sinz. 


Again, this is indeed true. One way to confirm it is to use differentiation 
from first principles, but, as with the sine function, this is a tricky process. 


A simpler way to confirm it is to deduce it from the fact that the 
derivative of the sine function is the cosine function. As you know, you can 
obtain the graph of the cosine function by translating the graph of the sine 
function by 7/2 to the left. It follows that you can obtain the graph of the 
derivative of the cosine function by translating the graph of the derivative 
of the sine function by 7/2 to the left. (This is because the two derivatives 
give the gradients of the two graphs.) 
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The derivative of the sine function is the cosine function, whose graph is 
shown in Figure 8. Translating this graph by 7/2 to the left gives the 
graph in Figure 9. Because of the symmetries of the graphs of sine and 
cosine, this graph is the graph of y = — sin z. So the derivative of 

f(x) = cosa is indeed f'(x) = —sinz. 


Figure 9 The graph of y = cos x translated by 7/2 to the left 


Derivative of tan 


The final derivative of a trigonometric function that you’ll meet in this 
subsection is the derivative of the tangent function. In fact, 


if f(z) = tana, then f'(x) = sec? z. 


Remember that sec? z means (sec x)’, and also that seca means 1/(cos x). 
So another way to write the derivative of f(x) = tan z is 


(Ge 


(cos x) 
However, we usually use the form f'(x) = sec? x, for conciseness. 


This formula for the derivative of the tangent function can be worked out 
by using the fact that tan x = (sinx)/(cosx), together with the formulas 
for the derivatives of f(x) = sinz and f(x) = cosx. The process involves a 
rule for combining derivatives that you’ll meet later in the unit, and you’ll 
see the justification of the formula there. 

For now, you might like to compare the graphs of y = tan x and y = sec? z, 
shown in Figure 10, and convince yourself that the graph of y = sec? x 


does seem, at least roughly, to give the gradients of the graph of y = tan z. 
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In particular, notice that the gradients of the graph of y = tan x are always 
positive and that the graph of y = tan z is the least steep whenever zv is a 
multiple of 7. 
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Figure 10 The graphs of (a) y = tan x (b) y = sec? x 


Here’s a summary of the three derivatives that you’ve met in this 
subsection. 


Derivatives of sin, cos and tan 


d 2 
= a = cos == (cos2) = =g = (amaze) = e a 
(sinz) = cosa T (cos x) ina T ( ) 


dx 


In the next activity you’re asked to use these formulas, together with the 
sum and constant multiple rules, to find the derivatives of some more 
functions. In this activity, and in general, remember that if a function is 
specified using function notation, then you should usually use Lagrange 
notation for its derivative, whereas if it’s specified by an equation that 
expresses one variable in terms of another, then you should usually use 
Leibniz notation. 


Activity 2 Differentiating functions involving sin, cos and tan 


Write down the derivatives of the following functions. 
(a) f(x) = sin x + cos x (b) glu) = u? — cos u (c) P=6tané 
(d) r = —2(1 + sin ¢ġ) 
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1.2 Derivatives of exp and In 


In this subsection you'll meet the formulas for the derivatives of the 
exponential function, f(a) = e”, and its inverse function, the natural 
logarithm function f(a) =Inaz. Remember that e is a special irrational 
constant, whose value is about 2.718. 


Derivative of exp 


Let’s look first at the exponential function, f(x) = e”. Its graph is shown 
in Figure 11. 
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Figure 11 The graph of y = e” 


You saw in Unit 3 that the constant e has the special property that the 
graph of f(x) = e” has gradient exactly 1 at the point with x-coordinate 0, 
as shown in Figure 12. So you already know one value of f'(x) for this 
function f: you know that f’(0) = 1. 
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Figure 12 The graph of y = e” has gradient 1 when x = 0 


1 Derivatives of further standard functions 


You can find the general formula for the derivative f'(x) of the function 
f(x) = e” by using differentiation from first principles. The difference 
quotient for f(x) = e” is 

f(et+h)— fe) _ ete? 


h h 


As usual, to find a formula for f'(x), you have to work out what happens 
to the value of the difference quotient as h gets closer and closer to zero. 
It’s not immediately obvious how to do this, but surprisingly you can do it 
by using the fact that you already know what happens in the particular 
case when x = 0, since you know that f’(0) = 1. 


When «x = 0, the difference quotient is 
Oth _ e? eh _j 


h hR 
So you already know that, as h gets closer and closer to zero, the value of 
this expression gets closer and closer to 1. 


Now look again at the difference quotient for a general value of x. You can 
rearrange it as follows: 


To eae i e—1 
E ak "Nn 


The expression in the brackets is the difference quotient in the case when 
x = 0. So as h gets closer and closer to zero, the expression in the brackets 
gets closer and closer to 1, and hence the value of the whole expression 
gets closer and closer to e”. That is, the formula for the derivative is 


Ta =e 


So the derivative of the exponential function is the exponential function! 


To see that this makes sense, look at the graph of the exponential function 
in Figure 11. As x increases, the gradient of the graph increases, and it 
increases more and more rapidly. So as x increases, the value of the 
derivative of the exponential function increases, and it increases more and 
more rapidly. The exponential function itself has the property that as x 
increases, the value of the function increases, and it increases more and 
more rapidly. 


The exponential function, and its constant multiples such as the function 
f(x) = 3e”, are the only functions that are equal to their own derivatives 
(except, of course, for functions obtained by restricting the domains of 
these functions). It’s this property that makes the constant e such a 
special number. 


11 
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Derivative of In 


Now let’s look at the natural logarithm function, f(x) = Ina. Since this 
function is the inverse of the exponential function f(x) = e”, its graph is 
the reflection of the graph of f(x) = e” in the line y = x, as shown in 
Figure 13. 


Figure 13 The graphs of y = ln z and y = e” 


So the gradients of the two graphs are related, and hence you can use the 
formula for the derivative of f(x) = e” to work out a formula for the 
derivative of f(x) = Ina. If you’d like to see this done, then have a look at 
the document The derivative of the natural logarithm function on the 
module website. 


However, a quicker way to work out the derivative of the natural logarithm 
function is to use a general rule, called the inverse function rule, that 
you'll meet later in the unit. This rule tells you how to use the formula for 
the derivative of any invertible function to work out a formula for the 
derivative of its inverse function. When you meet this rule, you'll see that 
the formula for the derivative of f(x) = ln x turns out to be 


1 

Fe) ==. 
For now, you might like to compare the graphs of y = ln z and y = 1/az, 
shown in Figure 14, and convince yourself that the function y = 1/x does 
seem to give the gradients of the graph of the function y = ln x. Only the 
right-hand half of the graph of y = 1/x is shown. The left-hand half 
doesn’t give gradients of the graph of f(x) = ln x, because In x is defined 
only for x > 0. 


1 Derivatives of further standard functions 


Figure 14 The graphs of (a) y = Inz (b) y = 1/z (x > 0) 


The two derivatives that you’ve met in this subsection are summarised in 
the box below. 


Derivatives of exp and In 


d 


pes d o1 
T le =e gez 


dr 


In the next activity you’re asked to use these formulas, together with the 
sum and constant multiple rules and the formulas for the derivatives of 
trigonometric functions, to find the derivatives of some more functions. 


Activity 3 Differentiating functions involving exp and log 


Find the derivatives of the following functions. 
(a) f(z) =e*4+1nz (b) h(r) =r — cosr — 3lnr 


(c) v=i+hnt (d) w = 5- 3e” (e) k = 4(lnv — tan v) 


The formulas for the derivatives of standard functions that you’ve met in 
this section are all included in the Handbook. However, it’s worth 
memorising the formulas for the derivatives of sin, cos, exp and ln, at least, 
as they occur frequently. 
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2 Finding derivatives of more 
complicated functions 


In this section you'll learn some more ways in which you can combine 
formulas for the derivatives of functions to obtain formulas for the 
derivatives of other, more complicated functions. 


2.1 Product rule 


Suppose that you know the formulas for the derivatives of two functions, 
and you want to know the formula for the derivative of their product. 
(Remember that the product of two functions is obtained by ‘multiplying 
them together’.) For example, you already know the formulas for the 
derivatives of the functions f(x) = x? and g(x) = sin z, but suppose that 
you want to know the formula for the derivative of the function 

k(x) = xz? sin z. 


You’ve seen that to obtain a formula for the derivative of the sum of two 
functions, you just add the formulas for the derivatives of the two 
individual functions — this is the sum rule. So you might hope that to 
obtain a formula for the derivative of the product of two functions, you just 
multiply the formulas for the derivatives of the two individual functions. 


Unfortunately, this doesn’t work! For example, consider the two power 
functions f(x) = x? and g(x) = x3. The derivatives of x? and z? are 2x 
and 3x”, respectively, and multiplying these two derivatives gives 6x3. 
However, the product of the functions f(a) = x? and g(x) = x? is the 
function k(x) = x°, and you already know that the derivative of 2° is 5x4. 
So multiplying the formulas for the derivatives of the two individual 
functions doesn’t give the right answer for the derivative of their product. 


In fact, to find a formula for the derivative of the product of two functions, 
you need to use the rule in the box below. It’s given in Lagrange notation, 
but you'll see it in Leibniz notation shortly. 


Product rule (Lagrange notation) 
If k(a) = f(x)g(a), then 
k'(x) = f(z)g'(x) + 9(x) f'(2), 
for all values of x at which both f and g are differentiable. 


The product rule can be proved by using the idea of differentiation from 
first principles, and you'll see this done at the end of this subsection. 


Your main task in this subsection is to learn how to use the product rule. 
The next example demonstrates how it can be used to differentiate the 


2 Finding derivatives of more complicated functions 


2 


function k(x) = x 
section. 


sin x, which was mentioned at the beginning of this 


An alternative way to write the final answer in Example 1 is as 
k'(x) = x(x cosx + 2sin x). Either form is acceptable. 


Here are some similar examples for you to try. 


Activity 4 Using the product rule 


Use the product rule to differentiate the following functions. 
(a) k(x) = xz? tanz (b) k(x) = ze” 


Activity 5 Using the product rule again 
Check that if you find the derivative of the function k(x) = x° by writing 


x5 = x? x z? and using the product rule, then you obtain the same answer 
as you obtain by differentiating it directly. 
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The product rule can be written in Leibniz notation as follows. 


Product rule (Leibniz notation) 


If y = uv, where u and v are functions of x, then 


dy _ ay du 
dz “de da’ 


for all values of x at which both u and v are differentiable. 


(The phrase ‘u and v are differentiable’ in the box above is a condensed 
way of saying that if we write u = f(x) and v = g(x) then f and g are 
differentiable at x.) 


However it’s written, in essence the product rule tells you the following: 


derivative derivative derivative 

first second 

of product | = À x | of second | + ; of first 
function function 


function function function 


For most people, the best way to remember and use it is as the 
abbreviated informal version below. 


Product rule (informal) 


derivative derivative derivative 
o a SHS & pee) telecoms ( of first ) ` 


When you use this informal version of the product rule, you can usually 
skip the steps of writing down the first and second functions and their 
derivatives, and just go straight to applying the rule. To do this, you recite 
the informal version of the rule in your head: as you think ‘first’, you write 
down the formula for the first function, then as you think ‘derivative of 
second’, you differentiate the second function in your head and write down 
the result, and so on. This is illustrated in the next example. 


Cc) Example 2 Using the product rule efficiently 
— Differentiate the function 


k(x) = (e e aa 
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2 Finding derivatives of more complicated functions 


There are various ways to write the final answer in Example 2. For 
example, the following are all acceptable: 


2z(1+Inz)+1 


1 1 
2+—4+2Inz, —+2(1+Inz) and 
z ki x 


You can practise using the informal version of the product rule in the next 
activity. Alternatively, you might prefer to continue using the formal 
version for a little longer, and move to the informal version when you feel 
more confident with it. You can use the exercises on the product rule in 
the Unit 7 practice quiz and exercise booklet for further practice. 


Whichever version of the product rule you use, remember to simplify your 
final answers, where possible. You should always do this when you 
differentiate functions. 


Activity 6 Using the product rule efficiently 


Use the product rule to differentiate the following functions. 

(a) k(x) = (3z? +2x + 1)e” (b) k(x) = sin z cos x 

(c) k(z)=zsinz (d) v= (2t? —1)cost (e) m = (u? +3)nu 
(£) y= yzsinz 


Hint: In part (f), convert the square root symbol to index notation before 
you apply the product rule. 
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Activity 7 Using the product rule to find a gradient 


Find dy/dx when y = z?e”. What is the gradient of the graph of y = xe” 
at the point with x-coordinate —1? 


To finish this subsection, here’s a proof of the product rule, using 
differentiation from first principles. It uses the Lagrange notation form of 
the product rule, which is repeated below. 


Product rule (Lagrange notation) 
Lt h(a) = fe) g(a), then 
k(x) = f(x)g'(x) + g(x) f (2), 


for all values of x at which both f and g are differentiable. 


A proof of the product rule 


Suppose that f and g are functions, and that the function k is given by 
k(x) = f(x)g(a). Let x denote any value at which both f and g are 
differentiable. To determine whether k’(x) exists, and find it if it does 
exist, you have to consider what happens to the difference quotient for k 
at x, which is 
k(x +h) — k(x) 
h 
(where h can be either positive or negative, but not zero), as h gets closer 
and closer to zero. Since k(x) = f(x)g(z), this difference quotient is equal 
to 
f(x + h)g(a + h) — f(x) g(@) 
7; . 
You can rearrange this expression so that it includes the difference 
quotients for f and g at x. To do this, you first subtract and add 
f(x)g(x +h) in the numerator. Then the expression becomes 


F(x + h)g(z +h) - f(z)g(a +h) + fla)g(@ +h) - f(x) g(a) 
h ? 
which is equal to 


(ZO) se n) + fo) (o, 


The expression in the first pair of large brackets is the difference quotient 
for f at x, and the expression in the second pair of large brackets is the 
difference quotient for g at x. So as h gets closer and closer to zero, the 
values of these two expressions get closer and closer to f'(x) and g'(x), 
respectively. Also, the value of g(x + h) gets closer and closer to g(x). 


2 Finding derivatives of more complicated functions 


Hence the whole expression gets closer and closer to f’(x)g(x) + f(x)g'(x); 
that is, f(x)g'(x) + g(x) f'(x). So 


k(x) = f(x)g'(x) + g(x) f (2), 


which is the product rule. 


2.2 Quotient rule 


Suppose now that you know the formulas for the derivatives of two 
functions, and you want to know the formula for the derivative of their 
quotient. (Remember that a quotient of two functions is obtained by 
‘dividing one by the other’.) For example, you already know the formulas 
for the derivatives of the functions f(x) = x? and g(x) = sin z, but suppose 
that you want to know the formula for the derivative of the function 

k(x) = x?/ sinz. 


You can find the derivative of the quotient of two functions by using the 
rule in the box below. It’s given in Lagrange notation, but you'll see a 
version in Leibniz notation shortly. 


Quotient rule (Lagrange notation) 
If k(x) = f(2)/g(2), then 
n — G2) F(x) = F(a)g'(2) 
f=" oer 


for all values of x at which both f and g are differentiable and 


g(x) # 0. 


You'll see a justification of the quotient rule in Subsection 2.5. Here’s an 
example that illustrates its use. 


© 
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There are various ways to write the final answer in Example 3. For 
example, the following are all acceptable: 


a(2sinz—acosz) 2x xcosxr 2x—-2? cotx 


sin? x > sing sin? x ’ sin x 


Here are two similar examples for you to try. 


Activity 8 Using the quotient rule 


Use the quotient rule to differentiate the following functions. 


(a) kay==(b) kla) = À 


x 
2r +1 


The quotient rule can be written in Leibniz notation as follows. 


Quotient rule (Leibniz notation) 


If y = u/v, where u and v are functions of x, then 


du 7 dv 
dy _ "de “de 
dz v2 i 


for all values of x at which both u and v are differentiable and v Æ 0. 


For most people the best way to remember and use the quotient rule is as 
the informal version below. 


2 Finding derivatives of more complicated functions 


Quotient rule (informal) 


(orton Gog n o 


derivative \ _ of top of bottom 
of quotient ) (bottom)? 


You can use this informal version of the rule in a similar way to the 
informal version of the product rule. To differentiate a quotient of two 
functions by using the quotient rule, you recite the informal version of the 
rule in your head: as you think ‘bottom’, you write down the formula for 
the bottom function, then as you think ‘derivative of top’ you differentiate 
the top function in your head and write down the result, and so on. (Make 
sure that you remember that the quotient rule contains a minus sign rather 
than a plus sign.) This is illustrated in the next example. 


Example 4 Using the quotient rule efficiently 


Differentiate the function 


2x + 1 
He Ina ` 


Solution 


@. Identify the top and bottom functions: here top = 2x + 1 and 
bottom = ln z. Use the rule 


(bottom) x derivative en derivative 
derivative \ _ of top P ee r 
of quotient} oa d 


By the quotient rule, 


(lng) x 2— (2x + 1) x 


(iam) 


S| = 


k'(x) = 
@. Simplify the answer. To clear the fraction in the numerator, 
multiply both the numerator and the denominator by z. ® 


1 
2xlng — (2r +1)x -xz 
a 


oma 


2e Ina — 227-1 
aoine) 
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You can practise using the informal version of the quotient rule in the next 
activity. Alternatively, as with the product rule, you might prefer to 
continue using the formal version for a little longer, and move to the 
informal version when you feel more confident with it. 


Activity 9 Using the quotient rule 


Use the quotient rule to differentiate the following functions. 


(e) f@)=F bowW=* © 2-4 
@) m=, © => 


In Subsection 1.1 you met the fact that the derivative of tan x is sec? x, but 
this result wasn’t justified there. It can be confirmed by using the quotient 
rule, as shown in the next example. (Note, however, that this justification 
is incomplete, as you haven’t yet seen a proof of the quotient rule!) 


Example 5 Using the quotient rule to find the derivative of tan 


Show that the derivative of tan x is sec? x. 


Solution 
@. Use the fact that tan xz = (sinx)/(cosx), and apply the quotient 
rule. ® 


By the quotient rule, 


a (tanz) = aah eu 
dx ~ dx \cosx 


_ (cos x)(cos x) — (sin x)(—sin x) 
a (cos x)? 

2 2 
_ cos” T + SINT 


cos? x 
@. Simplify this expression by using the facts that cos? x + sin? x = 1 
and 1/cosxz = secz. 4 
nl 
cos? T 
= sec’ x, 


as required. 


2 Finding derivatives of more complicated functions 


You can also use the quotient rule to find the derivatives of the 
trigonometric functions cosec, sec and cot. These derivatives are given in 
the box below, and you’re asked to confirm them in the next activity. 


Derivatives of cosec, sec and cot 


T (cosec x) = — cosec x cot x 
I 

d (sec x) = sec g ta 

— x) = sec v tan z 

dz 

d 2 

— (cot x) oe x 

qz (oot) 


Activity 10 Finding the derivatives of further trigonometric 
functions 


Use the quotient rule to confirm the following formulas for derivatives. 


d = 2 d = 
(a) = (cot x) = — cosec* x (b) T (sec x) = sec z tan x 


(o) e t 
Cc — (cosec T) = — Cosec T cott 
dz 


Hint: Use the facts that cot x = (cos x)/(sinx), sec x = 1/(cosx) and 
cosec x = 1/(sin x). 


The derivatives of cosec, sec and cot can now be included in the list of 
derivatives of standard functions that you have available to work with. 
The Handbook includes a table of all the standard derivatives given in this 
module. 


2.3 Chain rule 


Now that you know the product rule and the quotient rule, you can 
differentiate a much wider range of functions than you could before. 
However, there are still some fairly simple functions that you can’t 
differentiate using any of the rules that you’ve met so far. 


For example, consider the function k(x) = sin(x?). This function isn’t a 
sum, a product or a quotient of functions that you can differentiate. 
However, it’s still made up of functions that you can differentiate, just 
combined in a different way. To work out k(x) for a particular value of x, 
you have to first square x, then find the sine of the result. So the 
function k is a composite of the functions f(x) = x? and g(x) = sin z, as 
shown in Figure 15. You met the idea of a composite of two functions in 
Subsection 3.2 of Unit 3. 
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Figure 15 A composite of the functions f(x) = x? and g(x) = sin x 


The functions f(x) = x? and g(x) = sin z are functions that you can 
differentiate. 


In this subsection you’ll meet another rule for differentiation, called the 
chain rule, which allows you to differentiate functions that are composites 
of functions that you can already differentiate. You'll see towards the end 
of the subsection why it’s called the chain rule. It’s also sometimes called 
the composite rule or the function of a function rule. 


The first step in learning to use the chain rule is to learn to recognise when 
a function is a composite of two functions, and how to decompose such a 
function into its constituent functions. (Decomposing is the opposite of 
composing.) To see how to do this, let’s consider again the function 


k(x) = sin(«?). 
While you’re learning about decomposing functions, it’s easier to work 
with functions expressed using two variables rather than function notation, 
so let’s first express this function as 
y = sin(z?). 
To recognise that it’s a composite of two functions, you observe that it’s of 
the form 
y = sin(something), 
where the ‘something’ is an expression that contains the input variable zx. 
You can then decompose it into its two constituent functions. 
A convenient way to do this is to set the ‘something’ equal to an extra 
variable, say u. This gives 
y=sin(u) where u=a2?. 
These two equations specify the two constituent functions. 


In general, suppose that you have an equation that specifies y as a function 
of x. If the equation is of the form 


y = a function of ‘something’, 


where the ‘something’ is a function of the input variable x, then the 
original function is a composite function. To decompose it into its two 
constituent functions, you set the ‘something’ equal to u. In other words, 
you write y as a function of u, where u is a function of z. 


Here are some more examples that illustrate how to do this. 


2 Finding derivatives of more complicated functions 


Here are some examples for you to try. 


Activity 11 Decomposing composite functions 


For each of the following equations, write y as a function of u, where u is a 
function of a. 


(a) y = In(2*) (b) y = sin(x? — 1) (c) y = cos(e®) 
(d) y= (y= {N y= Vint 
(@) v=(e@+ ve) (h) y=} (i) y= evë 


The chain rule, which allows you to differentiate composite functions, is 
given below. It’s given in Leibniz notation, because it’s easier to remember 
in this form, but you'll see it in Lagrange notation later in this subsection. 


Chain rule (Leibniz notation) 


If y is a function of u, where u is a function of x, then 
dy dy du 


dr du dr’ 


for all values of x where y as a function of u, and u as a function of x, 
are differentiable. 
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As you can see, in Leibniz notation the chain rule looks like an obvious 
statement about fractions, which makes it easy to remember. You should 
keep in mind, though, that this is not what it is! The pieces of notation 
dy/dx, dy/du and du/dz represent derivatives, not fractions. 


As with the other rules for combining derivatives, the chain rule can be 
proved by using the idea of differentiation from first principles. 
Unfortunately, this is quite tricky to do, and the proof isn’t included in 
this module. 


However, here’s an informal way to see that the chain rule makes sense. 
Suppose that y is a function of u, where u is a function of x, as in the 
statement of the chain rule. Suppose that for a particular value of x, the 
value of du/dz is 2, and for the value of u that corresponds to this 
particular value of x, the value of dy/du is 3. Since y is increasing at the 
rate of 3 units for every unit that u increases, and u is increasing at the 
rate of 2 units for every unit that x increases, you’d expect y to be 
increasing at the rate of 3 x 2 = 6 units for every unit that x increases. 
This is what the chain rule tells you. 


The next example shows you how to use the chain rule. 


Example 7 Using the chain rule 
Differentiate the function y = sin(2?). 
Solution 


@. Recognise that the given function is a function of ‘something’, 
where the ‘something’ involves the input variable x. Hence decompose 
the given function. © 


Here y = sin u where u = 2”. 


@®. Find the derivatives of the constituent functions. & 


This gives 
dy du 
ee E = D, 
T g 
d dy d 
@. Apply the chain rule, = = = = 4 


By the chain rule, 


—_ (cosu) x 2x 


dx 


@. Express the answer in terms of x, by substituting for u in terms 
of x. Here we have u = 2”. # 


= (cos(x”)) x 2x 


2 Finding derivatives of more complicated functions 


@. Simplify the answer. ® 


= 2x cos(x’). 


Notice that when you apply the chain rule as illustrated in Example 7, you 
first obtain an answer that’s expressed in terms of both u and x. You then 
have to substitute for u, using the equation for u in terms of x that you 
wrote down when you decomposed the function, to obtain a final answer in 
terms of x. 


Activity 12 Using the chain rule 


Differentiate the following functions. 


(a) y= (b) y=sin(2a2—3) (c) y=sin () 


(d) y =cos(e”) (e) y = ln(sin x) (£) y = sin? z 


Hint: In part (f), remember that sin? x means (sin x)’. 


You’ve now seen how to apply the chain rule by first introducing an extra 
variable, usually u, which you then remove again to obtain the final 
answer. Once you’re familiar with using the chain rule, you can often 
apply it without introducing this extra variable at all. 


To see how to do this, consider again the equation y = sin(x?). Think of 
the expression that you would have written as u, that is, the expression 
that was referred to as ‘something’ earlier, as one ‘big variable’: 


y = sin((x’)) 


To find dy/dz, first differentiate with respect to this big variable; that is, 
with respect to the ‘something’. This gives 


cos( (x?) ) 


By the chain rule, to obtain the final answer, you now need to multiply by 
the derivative of the something with respect to x. This gives 

d 
a cos(x?) x 2x. 


da 
This is the final answer, but, as before, you should simplify it slightly to 
give 

d 
a2 = 9% cos(a”). 


dx 
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Here are two more examples. Notice that in the second of these examples 


the function is expressed using function notation rather than using two 
variables. This makes no difference to how you apply the ideas above. 


Example 8 Using the chain rule without introducing an extra 
variable 

Differentiate the following functions. 

(a) y=ln(2x-1) (b) k(x) = costz 

Solution 

(a) @ Think of the ‘something’, 2x — 1, as a ‘big variable’. 


Differentiate with respect to this variable. Then multiply by the 
derivative of 2x — 1 with respect to x. @ 


By the chain rule, 


d 1 
T - m=i ne 
@. Simplify the answer. ® 
2 
EE 


(b) &. The function is k(x) = (cosx)*, so think of the ‘something’, 
cos x, as a ‘big variable’. Differentiate with respect to this 
variable. Then multiply by the derivative of cos x with respect 
tor. @ 


By the chain rule, 
k'(x) = (4cos? x)(— sin x) 
@. Simplify the answer. @ 


= —Asin g cos? g. 


Try this quicker way of using the chain rule in the following activity. 


Activity 13 Using the chain rule without introducing an extra 
variable 


Differentiate the following functions. 

(a) f(x) = cos(4x) (b) f(x) = eti (c) r(x) = co x 
(da) g(x) = In(2? +241) (e) u(t) = sin(e’) (£) v=In(r°) 
(g) r = tan? 0 (h) R= e4 (i) A=(1+ 77)? 


2 Finding derivatives of more complicated functions 


You should try to get used to applying the chain rule in the quick way 
demonstrated in Example 8, as you'll find this skill useful in calculus. 
However, sometimes, if a function that you want to differentiate using the 
chain rule isn’t straightforward, then you might find it easier to revert to 
introducing an extra variable. It’s fine to do that if you find it helpful. 


Sometimes you can make it easier to apply the chain rule without 
introducing an extra variable if you start by rearranging the formula for 
the function. This applies in particular to composites that involve power 
functions. Here’s an example. 


Example 9 Applying the chain rule to a composite involving a A 
power function —_ 


Differentiate the function 


1 
a 
Solution 


®. The function f is of the form f(x) = 1//something, where the 
‘something’ is sin x. So use the chain rule. Start by rearranging the 
formula for f into the form ‘something”’. ® 


The function is 


1 
bby) = ; 
I ) ysin x 
that is, 


f(z) = (sin x) t. 
@. Apply the chain rule. ® 
By the chain rule, 
f'(x) = —4(sin «)~3/?(cos x) 


COS £ 


2sin3/? x 


With practice, you may be able to do the sort of initial rearranging needed 
in Example 9 in your head. 
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Activity 14 Applying the chain rule to composites involving power 
functions 


Differentiate the following functions. 
i 
(a) f(x) = vlnz (b) p(w) = (w3 +74 


1 1 
(d) f(x) = (©) a) = Garg 


Now here’s an explanation of where the name ‘chain rule’ comes from. 
Consider a function that’s a composite of three functions. Suppose that 


y is a function of u, 
u is a function of v, 
v is a function of z. 


You can find the derivative of y with respect to x by applying the chain 
rule twice: 


dy dy du du du dv 
d£ du dx dx dv dx 
so, using the second equation to substitute in the first equation, you obtain 
dy dy du du 
dx du dv da’ 


As you can see, this gives a ‘chain’ of derivatives. You can keep adding 
‘links in the chain’ indefinitely. One of the examples in Subsection 2.5, 
Example 12, illustrates how you can apply the chain rule twice in this way. 


Finally in this subsection, let’s translate the chain rule into Lagrange 
notation. You can do this by thinking about the way of applying it 
without introducing an extra variable. Suppose that f and g are functions, 
and you want to differentiate the expression g(f(x)) with respect to x. 
The chain rule says that you can do this by differentiating it with respect 
to the ‘something’ f(x), then multiplying the result by the derivative of 
f(x) with respect to x. This gives the version of the chain rule below. 


Chain rule (Lagrange notation) 
If k(x) = g(f(x)), where f and g are functions, then 
k'(x) = g'(f(2)) f'(x) 


for all values of x such that f is differentiable at x and g is 
differentiable at f(x). 


2 Finding derivatives of more complicated functions 


The first use of the chain rule seems to be by Gottfried Wilhelm 
Leibniz in a memoir of 1676, in which he calculated (with mistakes!) 
the derivative of Va + bz + cz*. Further examples of the use of the 
rule appear in the first calculus textbook, written by Guillaume de 
l'Hôpital and published in 1696, but nowhere is the rule actually 
stated. (There is more about |’H6pital’s book in Subsection 3.4.) It 
was not until the 19th century that the chain rule was commonly 
stated explicitly, and at that time it was generally described as a ‘rule 
for differentiating a function of a function’. The term chain rule seems 
to have originated as a translation of the German word Kettenregel, 
which was used in German calculus books in the early 20th century. 


2.4 Differentiating functions of linear expressions 


This subsection is about a type of function that occurs frequently, and 
which you can differentiate by using the chain rule. 
Consider the functions 

k(x) =sin(32) and h(x) =e7*. 
Each of them is a composite function, and in each of them the ‘something’ 
is a constant multiple of the input variable x; the ‘somethings’ are 3x 
and —2z2, respectively. It’s worth noticing, and remembering, what 


happens when you use the chain rule to differentiate functions like these. 
For example, 


if k(x) = sin(3z), then k'(x) = cos(3x) x 3 = 3cos(3z), 
if h(x) = e7™, then W(x) = e7? x (—2) = —2e7”., 


You can see that to differentiate a composite function in which the 
‘something’ is az, where a is a constant, you differentiate with respect 
to ax, and then multiply by a. 


It’s worth remembering this special case of the chain rule, which can be 
stated as below. 


Derivative of a function of the form k(x) = f(az) 
If k(x) = f(ax), where a is a non-zero constant, then 
k'(x) = af'(aa), 


for all values of x such that f is differentiable at az. 


You can use this rule to quickly differentiate any function of the type 
discussed above. For example, it tells you that 


if k(x) = sin(5a), then k/ (x) = 5cos(5z). 


Try using the rule above in the next activity. 
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Activity 15 Differentiating functions of the form k(x) = f (ax) 


Differentiate the following functions. 


(a) k(x) = cos(7z) (b) r(@) = sin (5) (c) glu) = e84/? 


In fact you can slightly extend the special case of the chain rule above. 
Notice that, by the chain rule, 


if k(x) = sin(3x — 2), then k(x) = cos(3x — 2) x 3 = 3 cos(3x — 2), 

if h(x) = e™™®+t5, then Weyer x (—2) S0e 
You can see that, in general, to differentiate a composite function in which 
the ‘something’ is az + b, where a and b are constants, you differentiate 


with respect to ax + b, and then multiply by a. This special case of the 
chain rule is summarised below. 


Derivative of a function of the form k(x) = f(ax + b) 
If k(x) = f(ax +b), where a and b are constants with a non-zero, then 
k'(x) = af'(ax + 6), 


for all values of x such that f is differentiable at ax + b. 


For example, this rule tells you that 
if k(x) = sin(5x — 8), then k'(x) = 5cos(5x — 8). 


Try using it in the next activity. 


Activity 16 Differentiating functions of the form k(x) = f(ax + b) 


Differentiate the following functions. 


(a) k(x) = cos(7x + 4) (b) h(x) = e72- (c) r(@) = sin () 


Hasi (2°) 


2 Finding derivatives of more complicated functions 


Remember that an expression of the form ax + b, where a and b are 
constants, is called a linear expression in x. So in this subsection you’ve 
seen how to quickly differentiate functions of linear expressions. 


2.5 Using the differentiation rules together 


You can differentiate a wide range of functions by combining the various 
differentiation rules that you’ve met. For example, you can use more than 
one differentiation rule, or more than one application of the same rule. 
This subsection starts with three examples that show you the kinds of 
things that you can do. 


When you’re combining differentiation rules in this way, it can be useful to 
use the notation d/dz to indicate the derivative of a function that appears 
as part of your working, then actually do the differentiation in the next 
line. This is illustrated in the examples. 
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Example 11 Combining the differentiation rules again 


Find the derivative of 


Na =F : 
= e2+1) 


Solution 


@. The function is of the form (something)*. So apply the chain rule. 
The ‘something’ isn’t straightforward to differentiate, so use the 
notation d/dzx to indicate its derivative. © 


dy r?-—-2\? d /r-2 ; 
T = Al (==) x T PE (by the chain rule) 
@. Now differentiate (x? — 2)/(x? + 1). It’s of the form 
something/something, so apply the quotient rule. ® 


— : (= + uen) = = — an 


(by the quotient rule) 
@. Simplify the answer. ® 


ie = 2) (2x)(x? + 1-2? +2) 


rl (x? + 1)? 
ae (= — a . 6x 
toes (ee 
247x? — 2) 
a 


Example 12 Combining the differentiation rules yet again 


Find the derivative of 


g(t) =e T): 


Solution 


®. The function is of the form cos(something). So apply the chain 
rule. The ‘something’ isn’t straightforward to differentiate, so use the 
notation d/dt to indicate its derivative. © 


g ¢) = —sin(e"™) x = (e+!) (by the chain rule) 


2 Finding derivatives of more complicated functions 


@. Now differentiate ett. It’s of the form e®°™t"™, So apply the 
chain rule again. ® 


= —sin(e*t!) x e+! x 2 (by the chain rule again) 


@. Simplify the answer. ©& 


= —2¢%+! gin (e241), 


sin( 


You can practise combining the differentiation rules in the next activity. If 
you find it tricky to work out which rules you need to apply, then the 
checklist below might help. 


Checklist for differentiating a function 


IL. 
2 
3. 


Is it a standard function (is its derivative given in the Handbook)? 
Can you use the constant multiple rule and/or sum rule? 


Can you rewrite it to make it easier to differentiate? For 
example, multiplying out brackets may help. 


Is it of the form f(ax) or f(ax + b), where a and b are constants 
and f is a function? If so, use the rule for differentiating a 
function of a linear expression. 


Can you use the product rule (is it of the form 
f(x) = something x something)? 

Can you use the quotient rule (is it of the form 
f(x) = something/something)? 


Can you use the chain rule (is it of the form 
f(x) =a function of something)? 


When you use a differentiation rule, you usually have to find the 
derivatives of simpler functions. You can apply the checklist above to 
each of these simpler functions in turn. 


Note also that when you need to apply more than one rule to differentiate 
a function, the first rule that you should apply is the one that applies to 
the overall structure of the rule of the function. For example, as you saw in 
Example 10, the overall structure of the function f(x) = z? sin(«*) is that 


it’s a product, since x 


? sin(x?) is the product of z? and sin(x). So, to 


differentiate this function, you start by applying the product rule, ignoring 
for the moment the fact that one of the two functions in the product is 
itself a composite. After you’ve done that, you can deal with the 
composite. 
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Activity 17 Combining the differentiation rules 


Differentiate the following functions. 


(a) o()=(@ +1)" +1) (b) f@) = 


(c) g(x) = cos(x ln z) (d) A(x) = et/2 sin(32) (e) q(u) = 


(f) f(x) =e) (g) h(x) = cos%(e2) (h) 2(2)= 


(i) G(x) = 3sin z cos x + 2sin(3z) (j) p(t) = te sint 
sin yu 
yu 


(k) r = sin(20) cos 0 (1) h = ze?” (m) v= 


Sometimes you have to differentiate functions that involve constants 
represented by letters. You can do this in the usual way, treating the 
constants like any other numbers. 


Activity 18 Differentiating functions involving constants 


Differentiate the following functions. 
(a) g(x) = e-**, where k is a constant. 
(b) f(x) = z cos(rz). 
(c) h 

( 


d) y = cos(3x) + c, where c is a constant. 


= sin? (a0), where a is a constant. 


Here’s another technique that’s occasionally useful when you want to 
differentiate a complicated function. Whenever you have a function that 
you can differentiate using the quotient rule, an alternative way to 
differentiate it is to rearrange its rule to express it as a product of two 
functions, and then apply the product rule. 


In the next activity you’re asked to differentiate a function in this way. 
(You were asked to differentiate the same function using the quotient rule 
in Activity 9(e).) 


Activity 19 Differentiating a quotient by using the product rule 


Int 
Differentiate the function p(t) = > by writing it as p(t) = t~? Int and 


using the product rule. 


2 Finding derivatives of more complicated functions 


In fact, the quotient rule is really just a combination of the product rule 
and a special case of the chain rule. Here’s a proof of it, as promised in 
Subsection 2.2. 

A proof of the quotient rule 


Suppose that the function k has rule k(x) = f(x)/g(x). This rule can be 
written as 


k(x) = (g(x) f (2). 


So, for each value of x at which both f and g are differentiable and 


g(x) #0, 


which is the quotient rule. 


A proof of the formula for the derivative of a power function 


Finally in this subsection, here’s a proof of the general formula for the 
derivative of a power function, as promised in Unit 6. The formula states 
that, for any constant n, 
d 
dz 
The proof given here is valid only for x > 0, as you’ll see. However, for a 
power function y = x” that’s defined when x < 0, you can use the fact that 
the formula holds for x > 0, together with the symmetry of the graph of 
y = x”, to deduce that the formula also holds for æ < 0; the details are not 
given here. To see that the formula also holds when x = 0, you can use the 
fact that at x = 0 the gradient of the graph of y = x” is 0. 


an" 
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To prove the formula for x > 0, you can use the fact that x = e” for such 
values of x to write the expression x” in the following form: 


gr = (enn = ering 


This gives 
d ni __ d nlng 
Jz (x”) = T (e ) 


d 
= (en) T (nlng) (by the chain rule) 
x 
=(e"™*) xn x L (Ing) (by the constant multiple rule) 
z 
1 
— nlng = 
= {er"") xn x z 


l 
=g7"xXnx> 
x 


which is the formula stated above. 


Note that the two proofs that you’ve just seen, namely the proof of the 
quotient rule and the proof of the formula for the derivative of a power 
function, both depend on the chain rule. Since this module doesn’t include 
a proof of the chain rule, strictly the proofs of these two results are 
incomplete. 


3 More differentiation 


In this section you'll see how to use differentiation to solve a type of 
problem that occurs frequently in applications of mathematics, how to use 
differentiation on a computer, and how to find the derivative of the inverse 
of a function if you know the derivative of the original function. 


3.1 Optimisation problems 


In mathematics, and its applications, it’s often helpful to solve problems 
that involve identifying the best possible option from a choice of suitable 
possibilities. Problems of this type are known as optimisation problems. 


For example, the manufacturer of an electronic device might want to 
identify the best price that it should charge for its device, if its goal is to 
achieve the maximum profit. Increasing the price might increase the profit, 
but increasing it further might decrease the profit, by causing fewer of the 
devices to be sold. This problem is an example of a maximisation 
problem, because it involves identifying the circumstances under which the 
maximum value of a quantity is obtained. Similarly, some optimisation 


problems are minimisation problems, which involve identifying the 


circumstances under which the minimum value of a quantity is obtained. 


You can often solve optimisation problems by modelling them 
mathematically and then applying calculus. In particular, you can often 
use the strategy below, which you met in Unit 6, or an adaptation of it. 


Strategy: 
To find the greatest or least value of a function on an 
interval of the form [a, bj 


(This strategy is valid when the function is continuous on the interval, 
and differentiable at all values in the interval except possibly the 
endpoints.) 


1. Find the stationary points of the function. 


2. Find the values of the function at any stationary points inside the 
interval, and at the endpoints of the interval. 


3. Find the greatest or least of the function values found. 


The next example illustrates how you can use this strategy to solve an 
optimisation problem. You might like to try guessing the answer to the 
problem posed in the example, and then read on to see whether you’re 
right. 
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Solution 
@. Simplify the problem if possible. Draw a diagram. ® 


The volume of the channel is the area of its cross-section times its 
(fixed) length. So to achieve a channel of maximum volume, we have 
to achieve the maximum area of the cross-section. 


The diagrams below show the cross-sections of the metal sheet and the 
channel. We have to determine how far from the side the metal sheet 
should be bent to achieve the maximum area of the cross-section. 


@. Identify the quantity that you can change, and represent it by a 
variable, noting the possible values that it can take. Identify the 
quantity to be maximised, and represent it by a variable too. ® 


Let the distance from the side of the metal to where it is bent be x 
(in m). The value of x must be between 0 and 3. 


Let the area of the cross-section be A (in m?). 


@. Find a formula for the variable to be maximised (the dependent 
variable) in terms of the variable that you can change (the 
independent variable). Start by annotating the diagram. © 


Y 
o 
< > 
il 
> = > 
T 1-22 x il = Dr 


The area of the cross-section is given by 
A= z(1 — 22). 
@. Formulate the problem in terms of the variables. © 


We have to find the value of x, between 0 and 5; that gives the 
maximum value of A. 
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@. Use calculus to solve the problem. You can use the strategy in the 
box given before this example. ® 


The variable A is a polynomial function of x, so it is continuous on its 
whole domain and differentiable at every value of x. So the maximum 
value of A must occur either at an endpoint of the interval [0, 3], or at 
a stationary point. 


@. Find the stationary points. @ 


The formula for A is 


A = z(1 — 2x) 
=p = D 
so 
= = 1 — 4r. 
At a stationary point, dA/dæ = 0, which gives 
1 — 4r = 0; 
that is, 
= 


So the only stationary point occurs when x = $. 


@. Consider the values of the function at any stationary points inside 
the interval, and at the endpoints of the interval. ® 


The endpoints of the interval are 0 and 5; so the maximum value of A 
occurs when x = 0, 7 = ; On 2. 


When z = 0, A = 0, and similarly when z = 5, A=. 
When z = 4, 
A=] e259) SG S — 5 — 025. 


@. Find the greatest of the function values found. © 


The greatest of the three values of A found is 0.125, which is achieved 
when z = + 


So the maximum value of A is achieved when z = t. 


®@,. State a conclusion that answers the original problem. © 


To achieve a channel of maximum volume, the metal should be bent 
at a distance of 25cm from each side. 
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In fact, in Example 13 it isn’t necessary to actually work out the values 

of A when z = 0, z = t and z = 5: When x = 0 the height of the channel 
is zero, and when x = i its width is zero, so in both these cases the area A 
will be zero. When x = 7 the height and the width of the channel are 
both positive, so the area A will also be positive. So you can see 
immediately that, of these three values of x, the value z = ; will give the 
greatest value of A. 


el 


The solution to Example 13 shows that the channel has maximum volume 
when it is twice as wide as it is high. 


When yov’re solving a maximisation or minimisation problem, it can be 
helpful to sketch a graph of the function involved, or plot it using a 
computer. This will give you a rough idea of how the quantity that you 
want to maximise or minimise changes as the quantity on which it depends 
changes. For example, Figure 16 shows a graph of the equation 

A = z(1 — 2x) from Example 13. You can see that, as x increases, the 
value of A first increases and then decreases, as you’d expect, and the 
maximum value of A seems to occur when x is about 0.25, which accords 
with the answer found in Example 13. 


area, A (m?) 


01 02 03 O04 O05 
distance from side, x (m) 


Figure 16 The graph of A = z(1 — 2x) for 0 < z < 4 


Notice that in fact you could have solved the problem in Example 13 
without using calculus, because the function involved turned out to be a 
quadratic function. You could have found the value of x that gives the 
maximum value of A by using a method other than calculus to find the 
vertex of the parabola that’s the graph of the function. You saw some 
methods for doing this in Unit 2. However, once you’re familiar with 
calculus, it’s usually easier to use it. 


Each of the next two activities poses an optimisation problem for you to 
solve by using calculus. The first activity involves maximising the volume 
of a box, and the second one involves maximising the area of the 
cross-section of an open channel, in a similar way to Example 13. In both 
activities, you might like to try guessing the answers to the problems posed 
before you start work on the solutions. 


Activity 20 Solving an optimisation problem 


Suppose that you have a sheet of cardboard that measures 1m by 1m. 
You intend to cut a square from each of the four corners and then fold the 
resulting shape to form a box (without a top), as shown. What should the 
side length of the squares be, to give a box of maximum volume? 


lm 


1m 


In the next activity you should use the fact that the area of a triangle with 
sides of lengths a and b and included angle 0 (as illustrated in Figure 17) is 
given by the formula A = sab sin. You met this formula in Unit 4. 


Activity 21 Solving another optimisation problem 


Consider again the problem of the open channel in Example 13. Suppose 
that instead of bending the metal along two straight lines, you intend to 
bend it along one straight line in the middle, to form an open channel with 
a v-shaped cross-section, as shown below. What should be the angle 
between the two sides of the v-shape, to give a channel with maximum 
volume? 


lm 
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Figure 17 A triangle with 
sides of lengths a and b and 
included angle 0 
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diameter 


height 


Figure 18 A cylinder 
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Here’s a summary of the main steps that you can use to solve an 
optimisation problem. Of course, you should combine these steps with 
other problem-solving techniques as appropriate, such as simplifying the 
problem if possible, drawing a diagram and drawing a graph. 


Strategy: 
To solve an optimisation problem 


1. Identify the quantity that you can change, and represent it by a 
variable, noting the possible values that it can take. Identify the 
quantity to be maximised or minimised, and represent it by a 
variable. These variables are the independent and dependent 
variables, respectively. 


2. Find a formula for the dependent variable in terms of the 
independent variable. 


3. Use the techniques of differential calculus to find the value of the 
independent variable that gives the maximum/minimum value of 
the dependent variable. 


4. Interpret your answer in the context of the problem. 


The optimisation techniques that you’ve met in this subsection are 
useful in the design of everyday objects. For example, suppose that 
you want to design the shape of a food tin so that it has a certain 
volume but uses the minimum amount of metal. You can use calculus 
to determine the optimal shape of the tin. If you simplify the problem 
by assuming that the tin is simply a hollow cylinder, as shown in 
Figure 18, then it turns out that the optimal shape is for the diameter 
of the cylinder to be equal to its height. (You might like to try to 
prove this.) Food tins tend to have proportions close to this, though 
to optimise accurately the shape of a food tin you would need to take 
into account the extra metal needed for the joins, and any differences 
in the thicknesses of metal needed for the sides, base and top. 


3.2 Differentiation using a computer 

Sometimes it’s convenient to use a computer to find the derivatives of 
functions, and to work with these derivatives. The next activity shows you 
how to use the module computer algebra system to do this. 


Activity 22 Using a computer for differentiation 


Work through Section 8 of the Computer algebra guide. 
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To illustrate how computers can be used to solve calculus problems, in the 
rest of this subsection we’ll use a computer to solve the following 
minimisation problem. 


A man is in a forest, 1 km away from the nearest point on a straight 
road that passes through the forest, as shown in Figure 19. He wants 
to get to a point 2km further along the road, as quickly as possible, by 
first running in a straight line through the forest and then running 
along the road. He can run at 8 kilometres per hour through the 
forest, and at 16 kilometres per hour along the road. At what point 
along the road should he aim to join it? 


Figure 19 The position of a man in a forest and his path to his 
destination 


Before we can use a computer, we need to carry out the first two steps of 
the strategy given at the end of the last subsection, to convert the problem 
from a word problem into a problem expressed in terms of variables and a 
function. 


To do this, let x be the distance in kilometres between the point where the 
man joins the road and the point on the road that is closest to his initial 
position, as shown in Figure 20. Then x can take any value in the interval 
[0,2]. Let T be the total time in hours taken by the man to reach his 
destination. 


Figure 20 An annotated diagram of the man’s position and path 
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nA 


We now have to express T in terms of x. By Pythagoras’ theorem, the 
distance in kilometres that the man runs through the forest is 


y 12+ 27. 
The distance in kilometres that he runs along the road is simply 
2—2. 


For each of the two parts of the man’s run, the time that he takes is equal 
to the distance that he runs divided by the speed at which he runs. Hence 
the total time T that he takes to reach his destination is given by 


ee: 
8 16 ` 


To solve the problem, we now need to use calculus to find the value of x in 
the interval [0,2] that gives the minimum value of T. That is, we need to 
carry out the final step of the strategy. 


T= 


You could do that without using a computer, and you might like to try 
this, if you want a challenge. However, in the next activity you’re asked to 
do it by using a computer. This is quicker and easier, and it will give you 
practice in using a computer to solve problems like this. A by-hand 
solution is provided at the end of the solution to the activity. 


Activity 23 Solving an optimisation problem using a computer 


Solve the problem discussed above, by following the steps below. Carry out 
parts (a), (b), (d) and (e) using the module computer algebra system. 


(a) Define a function f as follows: 


VITZ 2- 
f(a) == T 


(b) Plot the graph of f, for values of x in the interval [0,2] at least. 


(c) Use your plot to estimate roughly the value of x in [0,2] that gives the 
minimum value of f(z). 


(d) Find the derivative f’ of f. 
(e) Solve the equation f'(x) = 0. 


(£) Hence state, to the nearest 10 metres, how far along the road the man 
should join it. 


3.3 Derivatives of inverse functions 


In this subsection you’ll meet a rule, known as the inverse function rule, 
which you can use to work out a formula for the derivative of an inverse 
function when you know a formula for the derivative of the original 
function. 


The inverse function rule can be derived from the chain rule, as follows. 
We’ll use Leibniz notation, since the inverse function rule is usually easier 
to work with in Leibniz notation. 


Suppose that y is an invertible function of x. Then x is also a function 
of y. As you know, the derivative of x with respect to x is 1, and we can 
write this fact as 


dr 
Ta 
However, by the chain rule, 
dx dz dy 
dr dy da’ 
Therefore 
de dy _ 
dy dx 
Hence, for values of x for which dx/dy exists and is non-zero, 
dy 1 
de de/dy 


So we have the following rule. 


Inverse function rule (Leibniz notation) 


If y is an invertible function of x, then 


dy _1_ 
dr dz/dy’ 
for all values of x such that a exists and is non-zero. 


dy 


You can see that, as with the chain rule, the Leibniz form of the inverse 
function rule looks like an obvious fact about fractions. As with the chain 
rule, you should keep in mind that that’s not what it is. 


Here’s an informal way to see that the inverse function rule makes sense. 
Suppose that y is an invertible function of x, which means that x is also a 
function of y. Suppose that, for a particular value of x and its 
corresponding value of y, the value of x is increasing at the rate of 2 units 
for every unit that y increases, as illustrated in Figure 21. Then, at these 
values of x and y, you’d expect the value of y to be increasing at the rate 
of 3 unit for every unit that x increases. This is what the inverse function 
rule tells you. 


(x,y) 


Sy 


Figure 21 A point (x,y) on the graph of a function 
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You'll see the inverse function rule in Lagrange notation later in this 
subsection. First let’s look at some examples of how you can use it. 


Using the inverse function rule 


In the next example, the inverse function rule in Leibniz notation is used 
to work out that the derivative of In x is 1/xz. This is a fact that you met 
in Subsection 1.2. 


Example 14 Using the inverse function rule 


Use the inverse function rule, and the derivative of the exponential 
function, to differentiate y = ln z. 


Solution 
@. Express x in terms of y. ® 
We have y = ln z, so 
r= er, 
@. Differentiate with respect to y. & 
Therefore 
doy 
dy 
@. Use the inverse function rule. & 
By the inverse function rule, 
yt 
dr ey 
@. Use the relationship between x and y to express the derivative in 
terms of x. © 
Hence, since x = e”, 
Co C 


dr x 


As illustrated in Example 14, when you use the inverse function rule, you 
obtain an expression for the derivative that’s expressed in terms of y 
rather than x. You then have to use the relationship between x and y to 
express the derivative in terms of x. 


In the next example, the inverse function rule is used to find the derivative 
of the function f(x) = sin~'(x). You saw in Unit 4 that this function has 
domain [—1,1]. Its graph is shown in Figure 22. 


Example 15 Using the inverse function rule again 


Use the inverse function rule, and the derivative of the sine function, 


to differentiate y = sin7! x. 


Solution 
@. Express x in terms of y. ® 
We have y = sin! z, so 
ap = Sa Y), 
@. Differentiate x with respect to y. ® 


Therefore 
2 os 
— = cosy. 
dy Y 


@. Use the inverse function rule. ® 
By the inverse function rule, 

dy 1 

dz cosy’ 
provided that cosy Æ 0. 


@. Use the relationship between x and y to express the derivative in 
terms of x. In this case the relationship is given by x = siny (or 
y = sin™! r), and you need to express cosy in terms of x. You could 


write cosy = cos(sin-! x), but you can obtain a simpler expression by 


using the identity sin? y + cos? y = 1, as follows. ® 
The identity cos? y + sin? y = 1 gives 


cosy = +4/ 1 — sin? y = +V 1 — 2?. 


The + sign applies here, because y takes values only in the interval 
[—1/2, 7/2] (since y = sin™t(zx)) and so cosy is always non-negative. 
Hence 


cosy = V1—2?. 
Therefore 
dy _ il 


dz yi 


provided that z # +1. 
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Figure 22 The graph of 


y=sin tg 
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Figure 23 The graph of 


y =cos tax 


Figure 24 The graph of 


y= tan“! g 
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Notice that the formula for the derivative of sin™t x found in Example 15 
doesn’t apply at the endpoints of the domain of this function, that is, 
when z = —1 and g = 1. This is because the function f(x) = sin! z isn’t 
right-differentiable at x = —1 and isn’t left-differentiable at x = 1, as you 
can see, roughly, from the shape of its graph, in Figure 22. The ‘one-sided 
tangents’ at these values of x are vertical. 


Here’s another point to note about Example 15. Strictly, it involves the 
sine function with its domain restricted to the interval [—7/2, 7/2], rather 
than the sine function with its usual domain, R. As you saw in Unit 4, the 
sine function with its usual domain doesn’t have an inverse function. 


In the next activity you’re asked to find the derivatives of the functions 
f(x) = cos™} z and f(x) = tan~! z. You saw in Unit 4 that these functions 
have domains [—1,1] and R, respectively. Their graphs are shown in 
Figures 23 and 24, respectively. 


Activity 24 Using the inverse function rule 


Use the inverse function rule, and the derivatives of the cosine and tangent 
functions, to differentiate the following functions. 


(a) y=cos-* x (b) y = tan™t z 
Hint: To express the derivatives in terms of x, use the identity 


sin? y + cos? y = 1 in part (a), and the identity tan? y + 1 = sec? y in 
part (b). 


The derivatives found in Example 15 and Activity 24 are repeated below, 
and can now be included in the list of derivatives of standard functions 
that you have available to work with. 


oi (Sines a) = l 
dx eS 
1 
=i 
Å S —— 
: (aos a) E 
1 
— (tan7! 7) = 
7 (taney ~ 2) aoe 


In the next activity you’re asked to combine these standard derivatives 
with some of the rules for differentiation that you’ve seen, to find the 
derivatives of some more functions. 


Activity 25 Finding derivatives of functions involving inverse 
trigonometric functions 


Differentiate the following functions. 


(a) f(x) =sin—1(3z) (b) f(z) =e" cos™t x 


The inverse function rule in Lagrange notation 
Let’s now translate the inverse function rule into Lagrange notation. To do 
this, we set y = f7 t(x). Then 


dy -ly\ 
t= (FY), 


where (f~1)’ denotes the derivative of f~', as you’d expect. 


Also, the equation y = f~'(a) is equivalent to the equation x = f(y), so 


This gives the following form of the inverse function rule. 


Inverse function rule (Lagrange notation) 


If f is a function with inverse function f~t, then 


—1y/ 5 1 
PO) = Fey’ 


for all values of x such that f’(f~'(a)) exists and is non-zero. 


This is a formula for the derivative of f~t in terms of the derivative of f, 
but it may look rather complicated and difficult to understand. Here’s a 
helpful way to understand what it tells you, and why it holds. 
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First, it’s important to realise the following fact. If two straight lines 
(drawn on axes with equal scales) are reflections of each other in the line 
y = x, then their gradients are reciprocals of each other. This is because if 
you consider any pair of points on one line, and their reflections on the 
other line, then the values of the run and the rise are interchanged for the 
two pairs of points, as illustrated in Figure 25(a). The lines in this 
diagram have gradients 2 and 5: 


Another fact, which you met in Unit 3 and will be useful here, is that if 
two points (plotted on axes with equal scales) are reflections of each other 
in the line y = x, then they’re obtained from each other by interchanging 
z- and y-coordinates. For example, Figure 25(b) shows the point (3,1) and 
its reflection (1,3) in the line y = z. 


Yr. 7 UA 7 
rise=2, A A 


64 run=1 7 


Figure 25 (a) Two straight lines that are reflections of each other in the 
line y = x (b) Two points that are reflections of each other in the line 
v=o 

Now remember that the graphs of a function and its inverse function are 
reflections of each other in the line y = x, when they’re drawn on axes with 
equal scales. For example, Figure 26 shows the graphs of a function f and 
its inverse function f~'. (The function is in fact f(x) = z? +2 (x > 0), 
with inverse function f~!(x) = ys — 2.) The two tangents shown are 
reflections of each other in the line y = x, so their gradients are reciprocals 
of each other. The gradient of the red tangent is the gradient of the graph 
of f7! at the point (3,1), which is (f~')’(3). The gradient of the blue 
tangent is the gradient of the graph of f at the point (1,3), which is f’(1). 
Hence 


=1y/¢3) = 


This is a particular instance of the inverse function rule, with z = 3 and 


f) = f3) = 1. 


Figure 26 The graphs of a function f and its inverse function f~t, and 
their tangents at (1,3) and (3,1), respectively 


In general, consider the graphs of any invertible function f and its inverse 
function f7}, as illustrated in Figure 27. Consider two (non-vertical) 
tangents that are reflections of each other in the line y = zv, as illustrated. 
Their gradients are reciprocals of each other. The gradient of the red 
tangent is the gradient of the graph of f~! at the point (x, f~!(2)), which 
is (ft) (x). The gradient of the blue tangent is the gradient of the graph 
of f at the point (f~!(x),x), which is f’(f~!(x)). Hence 
1 


-ly — 
PTO) = N, 


This is the inverse function rule. 


Figure 27 The graphs of a function f and its inverse function f~t, and 
their tangents at (f~'(x),x) and (x, f~!(x)), respectively 
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3.4 Table of standard derivatives 


You’ve now reached the end of the material that introduces differential 
calculus in this module, though you'll use the ideas and methods again in 
Unit 11, Taylor polynomials. In the next section, you’ll make a start on 
learning about integral calculus. To round off your introduction to 
differential calculus, here’s a table of the standard derivatives that you’ve 
met. You should try to memorise at least the first six (those above the line 
in the middle of the table), as they’re used frequently in mathematics. 
This table of standard derivatives is also included in the Handbook. 


Standard derivatives 
Function f(x) Derivative f'(x) 


a (constant) 0 


nee nar! 
e” e” 
1 
lng — 
T 
sin x COS & 
COs £ — sin x 
tan x sec? x 
cosec £ — cosec T cot T 
sec x sec x tan x 
COUT — cosec? x 
: 1 
sin™! x — 
y1 — x2 
= 1 
COSY — — 
y1 -— r2 
-i 1 
tank ~ ae 5 
1l+2 


The first calculus textbook 


In about 1690 the wealthy Marquis de l’H6pital (1661-1704), a minor 
French nobleman, became interested in calculus as a result of journal 
articles by Gottfried Wilhelm Leibniz and the Swiss mathematicians 
Jacob and Johann Bernoulli. However, these articles contained little 
by way of explanation and so l’Hôpital paid Johann Bernoulli, then 
living in Paris, to tutor him in the new subject. When Bernoulli 
moved from Paris to Groningen to take up a professorship, they came 
to an arrangement whereby for a considerable monthly sum Bernoulli 
would send |’H6pital material on calculus, including his new results, 
and would deny access to the material to anyone else. In effect, 
l'Hôpital bought the rights to Bernoulli’s discoveries. 


By 1696 l'Hôpital felt he understood enough of the mathematics to 
publish a book on it, Analyse des Infiniment Petits (Analysis of 
Infinitely Small Quantities). Although in the book l'Hôpital 
acknowledges his debt to Bernoulli, after |’Hopital’s death Bernoulli 
complained that Hopital had won undeserved praise for the text. 
But because l’ Hôpital was known to have been a competent 
mathematician and because Bernoulli was known to be a contentious 
egotist, the latter’s complaints were generally dismissed. However, 
documents discovered in the 20th century have revealed that 
Bernoulli was in fact right to claim much of the book for himself. 


4 Reversing differentiation 


In Unit 6 you learned that if you know the values taken by a continuously 
changing quantity throughout a period of change, then you can use 
differentiation to find the values taken by its rate of change throughout the 
same period. For example, you saw that if you have a formula for the 
displacement of a moving object in terms of time, then you can use 
differentiation to find a formula for its velocity in terms of time. You saw 
that the area of calculus that deals with this type of process is called 
differential calculus. 


In the rest of this unit, and in Unit 8, we’ll consider the opposite process. 
We'll consider situations where you know the values taken by the rate of 
change of a continuously changing quantity throughout a period of change, 
and you want to find the values taken by the quantity throughout the 
same period. For example, you might have a formula or a graph that tells 
you the values taken by the velocity of an object (such as the walking man 
discussed in Unit 6) over some time period, and you want to use this 
information to work out the values taken by the displacement of the object 
over the same time period. As mentioned earlier, the area of calculus that 
deals with this type of process is called integral calculus. 


4 Reversing differentiation 
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As you'll see, when youw’re working with integral calculus, it can be useful 
to consider whether a function is continuous. A continuous function is 
one whose graph has no discontinuities (that is, ‘breaks’) in it. In other 
words, informally, it’s a function whose graph you can draw without taking 
your pen off the paper (though you might need an infinitely large piece of 
paper, as the graph might be infinitely long!). For example, the function 
f(x) = 2z is a continuous function, as shown in Figure 28(a), but the 
function f(x) = 1/2? isn’t, as its graph has a break at x = 0, as shown in 
Figure 28(b). 


4.1 Antiderivatives and indefinite integrals 


In integral calculus you usually start with a function f, and you want to 
find another function, say F, whose derivative is f. Such a function F is 
called an antiderivative of the original function f. For example, an 
antiderivative of the function f(x) = 2x is the function F(x) = x”, because 
the derivative of x? is 2z. 


The process of finding an antiderivative of a function is called 
antidifferentiation, or, more commonly, integration. So integration is 
the reverse of differentiation. 


The first thing to realise is that a function can have more than one 
antiderivative. To see this, try the following activity. 


Activity 26 Differentiating functions whose formulas are the same 
apart from a constant term 


Differentiate the following functions. 
= — — 5 
(a) F(x) = x? (b) F(x) =27+3 (c) F(a) =2?-2 


You should have found that all three functions in Activity 26 have 
derivative F’(x) = 2x. In fact, you can see that any function of the form 


F(x) =a? +c, 


where c is a constant, has derivative F’(x) = 2x. This is because the 
derivative of a constant term is zero. So each function of the 
form F(x) = x? +c is an antiderivative of the function f(x) = 2z. 


Another way to think about the fact that all the functions of the form 
F(x) = x? +c have the same derivative is to remember that adding a 
constant to the formula of a function has the effect of translating its graph 
vertically. So the graphs of all the functions of the form F(x) = x? + c are 
vertical translations of each other, as illustrated in Figure 29(b). 
Translating a graph vertically doesn’t change its gradient at each x-value, 
of course. In other words, it doesn’t change the derivative of the function 
that the graph represents. So each of the functions in Figure 29(b) has the 
same derivative, namely f(x) = 2x, whose graph is shown in Figure 29(a). 


Another thing to appreciate about all the functions of the form 

F(x) = x? +c, where c is a constant, is that these functions are the only 
antiderivatives of the function f(x) = 2x. That’s because any 
antiderivative of f(x) = 2x has the same gradient at each z-value as the 
function F(x) = x”, and the functions of the form F(x) = x? + c are the 


only functions with this property. 
NL), 
i/ 2 


2 
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(b) F(z) =x? +c 


Figure 29 The graphs of (a) a function (b) some of its antiderivatives 


So the formula F(x) = z? + c describes the complete family of 
antiderivatives of the function f(x) = 2x. We call the general 

function F(x) = x? + c the indefinite integral of the function f(x) = 2z. 
The word ‘indefinite’ refers to the fact that the constant c can take any 
value. 


In general, consider any function f that has an antiderivative. You can see 
that you can add any constant that you like to the formula for the 
antiderivative, and you'll get the formula for another antiderivative of f. 
Equivalently, you can translate the graph of the antiderivative vertically by 
any amount, and you'll get the graph of another antiderivative of f. 


These antiderivatives are the only antiderivatives of the function f, 
provided that f is a continuous function. 


So, if f is a function that has an antiderivative, and f is continuous, then 
the general function 


F(x) = (formula for any particular antiderivative of f) + c, 


where c represents any constant, and whose domain is the same as the 
domain of f, describes the complete family of antiderivatives of f. We call 
this general function F the indefinite integral of the function f. 


The constant c in an indefinite integral is called an arbitrary constant, 
or the constant of integration. It’s usually denoted by c, but you can 
use any letter. Like the word ‘indefinite’ in ‘indefinite integral’, the word 
‘arbitrary’ in ‘arbitrary constant’ refers to the fact that the constant c can 
take any value. 
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You don’t need to be concerned about what happens when the function f 
isn’t continuous, because normally when we’re solving problems in integral 
calculus we work only with continuous functions. These are usually the 
only functions that we need for the sorts of calculations that we want to 
perform. However, if you’d like to know what the problem is with 
functions that aren’t continuous, then read the explanation at the end of 
this subsection, when you reach it. 


Before you go on, it’s important to make sure that you fully understand 
the difference between an antiderivative and an indefinite integral of a 
function. The definitions that you’ve seen are summarised below. 


Antiderivatives and indefinite integrals 
Suppose that f is a function. 


An antiderivative of f is any specific function whose derivative is f. 


If f has an antiderivative, and f is continuous, then the indefinite 
integral of f is the general function obtained by adding an arbitrary 
constant c to the formula for an antiderivative of f. It describes the 
complete family of antiderivatives of f. 


4 Reversing differentiation 


Activity 27 Understanding antiderivatives and indefinite integrals 


(a) Show that the function F(x) = 4 sin(2z) is an antiderivative of the 
function f(x) = cos(2z). 


(b) What is the indefinite integral of the function f(x) = cos(2x)? 


(c) Write down an antiderivative of the function f(x) = cos(2x) other 
than the antiderivative in part (a). 


Even though the idea of an indefinite integral really applies only to 
continuous functions, it’s convenient in practice to state indefinite integrals 
of other functions. For example, consider again the function f(x) = 1/2?, 


which isn’t continuous. Its graph is repeated in Figure 30. An ut 
antiderivative of this function is the function F(z) = —1/z, as you can 44 
check by differentiating this function F. Even though f isn’t continuous, | 
we do still say | 

the function f(x) = 1/x? has indefinite integral F(a) = —1/x + c. D T 
This is shorthand for —2+ 

any continuous function f with rule f(x) = 1/x? has indefinite =o 

integral F with rule F(x) = —-1/r+c. 


For example, the continuous function f(x) = 1/x? (x > 0), whose graph is 


shown in Figure 31, has indefinite integral F(x) = —1/x +c (x > 0). . 
Figure 30 The graph of the 


We use this shorthand for any function that isn’t continuous. function f(x) = 1/2? 


As you’ve seen, to find an indefinite integral of a function, you just find a 
formula for any antiderivative of the function, and add an arbitrary 
constant, usually c. 


You can find antiderivatives of many functions by using what you already 

know about derivatives. In the next two subsections you’ll practise finding 
antiderivatives, and hence indefinite integrals, of some simple functions in 

this way. Before you make a start on that, here are a few things that you 

need to know. 


First, note that when we’re discussing antiderivatives and indefinite 
integrals, we use conventions similar to those for derivatives. For example, 
the terms ‘antiderivative’ and ‘indefinite integral’ can be applied directly 


to expressions that could appear on the right-hand side of the rule of a =e] 
function, as well as to the function itself. For instance, rather than saying 
that 
the indefinite integral of f(x) = 2x is F(x) = 27 +c, Figure 31 The graph of the 


A Bey 
we can simply say that function f(x) = 1/x° (x > 0) 


the indefinite integral of 2x is x? + c. 


Also, as you’d expect, you can use any letters for functions and variables — 
you're not restricted to the standard ones, f and z. 
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When we’re working with antiderivatives and indefinite integrals in the 
rest of this unit, we’ll usually use a lower-case letter, such as f, to denote 
the function that we start with, and the corresponding upper-case letter, 
such as F, to denote any of its antiderivatives, or its indefinite integral. 
You'll meet a more adaptable, though more complicated, notation for 
indefinite integrals in Unit 8. It involves the integral sign, f , which you 
may have seen elsewhere. 


Functions that aren’t continuous 


If you’d like to know what the problem is with functions that aren’t 
continuous, then read the discussion below. Otherwise just go on to 
Subsection 4.2. 


A function f that has an antiderivative but which isn’t continuous may 
have other antiderivatives as well as those obtained by vertically 
translating the graph of the original antiderivative. For example, consider 
the function f(x) = 1/x?, whose graph is shown in Figure 32(a). One 
antiderivative of this function f is the function F(x) = —1/z, as mentioned 
above. The graph of this function F is shown in Figure 32(b). Every 
function obtained by vertically translating this graph is an antiderivative 
of f, but so is every function obtained by vertically translating each of the 
two pieces of the graph individually, since that doesn’t change the gradient 
at each x-value. For example, Figure 32(c) shows an antiderivative of f 
obtained by translating the left-hand piece of the graph of F(x) = —1/x 
down by 2 units, and the right-hand piece up by 1 unit. 


yt y y 
4- i 4 
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eee 1 ae = 
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Figure 32 The graphs of (a) f(x) = 1/2? (b) the antiderivative 
F(x) = —1/x of f(x) = 1/x? (c) another antiderivative of f(x) = 1/2? 


So if f is a function that has an antiderivative, but which isn’t continuous, 
then the general function 


F(x) = (formula for any particular antiderivative of f) + c, 


where c represents any constant, and whose domain is the same as the 
domain of f, usually doesn’t describe the complete family of antiderivatives 
of f, as stated earlier. As mentioned earlier, this isn’t a problem, because 
normally when we’re solving problems in integral calculus we work only 
with continuous functions. 


4.2 Antiderivatives of power functions 


In this section you'll see how to find antiderivatives, and hence indefinite 
integrals, of power functions. 


You can find antiderivatives of power functions by using what you already 
know about derivatives of power functions. For example, suppose that you 
want to find an antiderivative of the power function f(x) = xf. Consider 
what happens when you differentiate the power function whose exponent 


is 1 more than 6, namely g(x) = x": 
d; 6 
F (x ) =a 


This equation tells you that x” is ‘nearly’ an antiderivative of xê. You can 
deduce from it that if you multiply x’ by the constant z, then 
differentiating will give the answer that you want, 2°: 


d /1 1 
Eze) =7 x 775 = zê. 


So an antiderivative of f(z) = 2° is F(x) = 72". Hence the indefinite 


integral of f(x) = 2° is 


ae Wit 
F(x) = 52" +c. 
You can find a general formula for an antiderivative of a power function by 
applying the thinking above to the general power function f(x) = x”. 
Differentiating the power function g(x) = x"*! gives 


2 (°t!) = (n+1)2”. 
So if you multiply z”*! by 1/(n + 1), then differentiating will give the 


answer that you want, x”: 


d 1 n+1 1 
— = 1a” = 2”. 
tle ap es je =g 


This equation gives a general formula for an antiderivative, and hence the 
indefinite integral, of a power function, which is stated below. The formula 
applies for every value of the exponent n, except n = —1. It doesn’t apply 
for n = —1 because division by zero is undefined. (The case n = —1 is 
more complicated, and is dealt with in Subsection 6.1.) 


Indefinite integral of a power function 


For any number n except n = —1, 


il 
the indefinite integral of x” is i TLE, 
n 


Here’s a helpful informal way to remember this formula. 
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Indefinite integral of a power function (informal) 


To find an antiderivative of a power function (except the power 
function f(x) = ~'), first increase the power by 1, then divide by the 
new power. 


As usual, to obtain the indefinite integral, add the arbitrary 
constant c. 


Example 17 Finding indefinite integrals of power functions 


Find the indefinite integrals of the following functions. 


(a) f(x)=2 (b) ()= (9) Ala) = vz 


Solution 


(a) ®. To find an antiderivative, increase the power by 1, then divide 
by the new power. To obtain the indefinite integral, add the 
arbitrary constant c. ® 


The indefinite integral is 
il 
F(z) = —r +c. 
(2) I“ FE 


(b) & First write the function in the form g(x) = z”. & 
The function is g(x) = x~°. 
@. Proceed as in part (a). @ 


The indefinite integral is 


@. Simplify the answer. ® 


ape 
SS 


(c) @& First write the function in the form y = z”. @ 
The function is h(x) = z". 
@. Proceed as in part (a). @ 
The indefinite integral is 


1 
H(z) = ae ar © 


Activity 28 Finding indefinite integrals of power functions 


Find the indefinite integrals of the following functions. 


(a) fz) =2% b pw=— (©) F(z) = 2% 


= 


(d) f= Ye (e) g= (£) Ww) = 


Activity 29 Finding different antiderivatives of a function 


Let f be the function with rule f(x) = 2°. 
(a) Find the indefinite integral of f. 


(b) Hence write down three different antiderivatives of f. 


Notice that the formula for the indefinite integral of a power function tells 
you that the indefinite integral of the function 


f(x) =1. (which is the same as f(x) = z?) 
is 
Fa= 76 
(As in earlier units, we assume that 0° = 1, so the formula works when 


x =0.) Another way to obtain the indefinite integral above is, of course, to 
use the fact that the derivative of x is 1. 


In Subsection 6.1 later in the unit, you’ll see a formula for the indefinite 
integral of the only power function that isn’t covered by the general 
formula that you’ve met in this subsection. This is the power function 
f(x) = x7}, that is, the reciprocal function f(x) = 1/2. 
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4.3 Constant multiple rule and sum rule for 
antiderivatives 


Just as with derivatives, once you have formulas for antiderivatives of some 
functions, you can combine them to obtain formulas for antiderivatives of 
further functions. 


Unfortunately, though, it’s generally more tricky to combine 
antiderivatives than it is to combine derivatives. For example, there’s no 
straightforward rule that you can use to find an antiderivative of a product 
of two functions if you already know an antiderivative of each of the two 
individual functions. In other words, there’s no ‘product rule for 
antiderivatives’. Similarly, there’s no ‘quotient rule for antiderivatives’ and 
no ‘chain rule for antiderivatives’. 


However, there is a constant multiple rule and a sum rule for 
antiderivatives, and they work in the same simple ways as the similar rules 
for derivatives, as follows. 


Constant multiple rule for antiderivatives 


If F(x) is an antiderivative of f(x), and k is a constant, then kF (x) is 
an antiderivative of kf (x). 


Sum rule for antiderivatives 


If F(x) and G(x) are antiderivatives of f(x) and g(x), respectively, 
then F(x) + G(x) is an antiderivative of f(x) + g(x). 


(You might have noticed that the letter used for the constant in the 
statement above of the constant multiple rule for antiderivatives, namely k, 
is different from the letter used in Unit 6 for the constant in the constant 
multiple rule for derivatives, which was a. It doesn’t matter which letter is 
used, of course, but in integral calculus it’s sometimes convenient to 
reserve the letter a to denote something else, as you'll see in Unit 8.) 


These rules follow directly from the similar rules for derivatives. To see 
why, first suppose that F(x) is an antiderivative of f(x). Then 


d 

£ (Fle) = f(a). 
It follows from the constant multiple rule for derivatives that, for any 
constant k, 


d 
qz EE) = kf (2). 


This equation tells you that kF (x) is an antiderivative of k f(x), which is 
what the constant multiple rule for antiderivatives says. 
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Now suppose that F(x) and G(x) are antiderivatives of f(x) and g(x), 
respectively. Then 


d d 

ae (F(x)) = f(a) and T (G(x)) = g(x). 
It follows from the sum rule for derivatives that 

d 

PT (F(x) + G(x)) = f(x) + g(x). 


This equation tells you that F(x) + G(x) is an antiderivative of 
f(x) + g(x), which is what the sum rule for antiderivatives says. 


Remember that, as always when a constant multiple rule and a sum rule 
hold, the sum rule also holds if you replace the plus signs by minus signs. 
That is, if F(x) and G(x) are antiderivatives of f(x) and g(x), respectively, 
then F(x) — G(x) is an antiderivative of f(x) — g(x). 


Here’s an example of how you can use the two rules above. 


== 
Example 18 Using the constant multiple rule and the sum rule for A 
antiderivatives — 
Find the indefinite integrals of the following functions. 


(a) f(e)=10e4 (b) g(t) = $ -SvE+1 


Solution 


(a) @ Find an antiderivative by using the constant multiple rule, 
together with the rule for finding an antiderivative of a power 
function. To obtain the indefinite integral, add the arbitrary 
constant c. & 


The indefinite integral is 
F(x) =10x $a° +c 
=O ee 
(b) ®. First express the function as a sum of constant multiples of 
power functions. & 
The function is 
g(t) = 6t? — 5t/? +1. 
@. By the sum rule, you can find an antiderivative by working 
term by term. Use the method of part (a) to find an 


antiderivative of each term. To obtain the indefinite integral, add 
the arbitrary constant c. ® 
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Notice in particular that it follows from the constant multiple rule for 
antiderivatives that, for any constant a, the indefinite integral of the 
function 


f(x) =a_ (which is the same as f(x) =a x 1) 


F(x) =az +c. 


Another way to see this is, of course, to use the fact that the derivative of 
az is a. 


This useful fact is summarised below. 


Indefinite integral of a constant function 


The indefinite integral of the constant a is ax + c. 


You can practise using the constant multiple rule and the sum rule for 
antiderivatives in the activity below. In parts (d)-(g), you have to begin 
by using an algebraic technique, such as multiplying out brackets or 
expanding a fraction, to express the function as a sum of constant 
multiples of power functions. 


Remember that you must include an arbitrary constant c in every 
indefinite integral that you find. 


Activity 30 Using the constant multiple rule and the sum rule for 
antiderivatives 


Find the indefinite integrals of the following functions. 
(a) f(z) =8z? (b) f(z)=7 (c) fe) =Va-2° 
(d) f(z) =(e@+2)(e—5) (e) g(t) = (t+ 1)? 


r? 
Ow W roho- 


T 


You'll meet some more ways to combine antiderivatives in Unit 8. 


5 Particular antiderivatives 


5 Particular antiderivatives 


Sometimes when you’re working with a function, you need to find a 
particular antiderivative that it has, rather than any antiderivative, or its 
indefinite integral. For example, this is often what you need to do when 
the function is a mathematical model of a real-life situation. In this 
section, you'll learn how to find such particular antiderivatives, and see 
some examples of how you can use them. 


5.1 Finding particular antiderivatives 


You can work out which of the infinitely many antiderivatives of a function 
is the right one if you have some appropriate extra information. Usually 
the extra information that you have is the value taken by the 
antiderivative at some value of the input variable. Here’s an example. 


Here are some similar examples for you to try. 
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Activity 31 Finding particular antiderivatives 


(a) Find the antiderivative F of the function f(x) = x + 2 such that 
F(1) = 3. 


(b) Find the antiderivative F of the function 


f(z) =1/a* (s € (0,00)) 
such that F(3) = —$. 


5.2 Using integration to work with rates of change 


As mentioned at the start of this section, it’s often useful to find a 
particular antiderivative when you’re working with a function that models 
a real-life situation. This is illustrated in the example below. 


In this example, we’ll consider again a situation that you met in Unit 6, 
namely the one where a man walks along a straight path, gradually 
slowing down as he walks. In Section 3.1 of Unit 6, we started with the 
equation for the man’s displacement in terms of time, and we used 
differentiation to find an equation for his velocity (his rate of change of 
displacement) in terms of time. Here we’ll do the opposite: we’ll start with 
the equation for his velocity in terms of time, and we’ll use integration to 
find an equation for his displacement in terms of time. Remember that 
integration is the opposite process to differentiation — it’s the process of 
finding an antiderivative of a function. 
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Solution 
(a) @ Write down the given equation for the man’s velocity. ©@ 
The man’s velocity v at time t is given by 
ü= = Št. 
@. Use integration to find an equation for his displacement. It 


will contain an arbitrary constant, because at this stage we don’t 
know which antiderivative is the right one. ® 


Hence his displacement s at time t is given by 
s=6t- Š x }ť +c, 
that is, 
s = 6t — at? ar Gy 
where c is a constant. 
@. Use extra information to find the value of the constant c. ©& 


When the man begins his walk, his displacement from his starting 
point is 0km. That is, when t = 0, s = 0. Substituting these 
values into the equation above gives 


0=6x0-2xl+te, 
that is, 
e= 0. 
Hence the required equation for s in terms of t is 
s=6t— 20. 
(b) When ¢t = 2, 
s=6x2—-2x2=12-3=2=95. 


So the man’s displacement two hours after he began his walk is 
9.5 km. 


The calculation in Example 20 is the sort of calculation that integral 
calculus allows you to carry out: if you know the values taken by the rate 
of change of a quantity throughout a period of change, then you can use 
integration to find the values taken by the quantity throughout the same 
period. 


Here’s something else that’s illustrated by Example 20. If the only 
information that you have about the motion of an object is an equation for 
its velocity in terms of time, then you don’t have enough information to 
find an equation for its displacement in terms of time. The best that you 
can do is to find an equation for its displacement in terms of time that 
contains an arbitrary constant. To find a definite equation (that is, to 


5 Particular antiderivatives 
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choose the correct function from a family of functions whose graphs are 
vertical translations of each other), you need to use further information, 
which is usually a fact about the displacement of the object at some 
moment in time. 


This is just what you’d expect: if all you know is an equation for the 
velocity of the object in terms of time, then you don’t have enough 
information to determine its displacement in terms of time, because it 
could have started anywhere! 


In this sort of situation you do, however, have enough information to find 
the change in the displacement of the object between any two points in 
time. You’ll see some examples of this later in this subsection. 


Here’s an example similar to Example 20 for you to try. 


Activity 32 Using integration to deduce displacement from velocity 


Suppose that a marble is rolled in a straight line down a long slope, and its 
velocity v (in metres per second) at time ¢ (in seconds) after it begins 
rolling is given by the equation 

v= iit. 
The marble starts rolling from a position that is 0.3 metres down the slope. 
Let s be its displacement in metres from the top of the slope. 
(a) Find an equation that expresses s in terms of t. 


(Hint: First find an equation for s in terms of t that contains an 
arbitrary constant, then use the value of s when t = 0.) 


(b) Hence find the displacement of the marble from the top of the slope 
4 seconds after it begins rolling. 


Let’s now consider another example of motion in a straight line from 
Unit 6. You saw that if an object falls from rest under the influence of 
gravity, and the effects of air resistance are negligible, then its 
displacement s (in metres) at time t (in seconds) after it began falling is 
modelled by the equation 


s = —4.9t?. 


Here the displacement of the object is measured from the point from which 
it starts to fall, and the positive direction along the line of motion is 
upwards. 


You saw that if you differentiate this equation, to obtain an equation for 
the object’s velocity in terms of time, and then differentiate again to 
obtain an equation for its acceleration in terms of time, then you obtain 
the fact that the object is moving with a constant acceleration of 
—9.8ms7?. The magnitude of this acceleration, 9.8 m s7?, is known as the 
acceleration due to gravity. 


5 Particular antiderivatives 


In the next example, we’ll carry out this process in reverse. We’ll start by 
assuming that the effect of gravity on any object falling from rest is to 
produce a constant acceleration of —9.8ms~?. We’ll use this fact to write 
down an equation for the object’s acceleration in terms of time, integrate it 
to obtain an equation for the object’s velocity in terms of time, and then 
integrate again to obtain an equation for the object’s displacement in 
terms of time. You'll see that the result is the equation stated above. 


Example 21 Using integration to deduce displacement from A 
acceleration nn 
Assume that an object falling from rest has a constant acceleration of 
—9.8ms~?, where the positive direction is upwards. 


Let s be the displacement in metres of the object from its initial 
position and let t be the time in seconds since it began falling. 


Find an equation that expresses s in terms of t. 
Solution 
@. Define the variables that you intend to use. © 


Let v be the velocity of the object in ms! and let a be its 


acceleration in ms~2. 


@. Write down the equation for the acceleration of the object. & 
The equation for a in terms of t is 
@ = =O. 


@. The variable t doesn’t appear on the right-hand side of this 
equation because the acceleration is constant. ©& 


@. Use integration to find an equation for v in terms of t, containing 
an arbitrary constant. © 


Hence the equation for v in terms of t is 
v = —9.8t + C, 
where c is a constant. 
@. Use extra information to find the value of the constant c. ® 


The object falls from rest, so when it begins falling its velocity is 
Oms~!. That is, when t = 0, v = 0. Substituting these values into the 
equation above gives 


0=-9.8xO0-+c, thatis, c=0. 
So the equation for v in terms of t is 


v = —9.8t. 
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@. Use integration again to find an equation for s in terms of t, 
containing another arbitrary constant. Don’t use the letter c for the 
arbitrary constant, as it’s been used already. Choose a different letter, 
such as b. @ 


It follows that the equation for s in terms of t is 
s = —9.8 x 50? +), 
that is, 
s = —4.927 + b, 
where b is a constant. 
@. Use extra information to find the value of the constant b. © 


When the object begins falling, its displacement is 0m. That is, when 
t = 0, s = 0. Substituting these values into the equation gives 


0=-49x07+b, thatis, b=0. 
So the equation for s in terms of t is 
s = —4.92?. 


This is the required equation for the displacement of the object in 
terms of time. 


The next activity is about the motion of a ball thrown vertically into the 
air. The ball is an example of a vertical projectile. A projectile is an 
object that’s launched into the air by a force that ceases after launch, and 
a vertical projectile is a projectile that’s launched vertically upwards. 


Provided that the effects of air resistance are negligible, a vertical 
projectile has a constant downwards acceleration due to gravity of about 
9.8ms~? at all times after its launch, just as an object falling from rest 
has. The only thing that makes its motion different from that of an object 
falling from rest is that at the start of its motion its velocity is not zero. In 
the next activity you’re asked to use a process similar to that in 

Example 21 to find a formula for the displacement of a ball thrown 
vertically into the air, in terms of time. 


Activity 33 Using integration to deduce displacement from 
acceleration 


Suppose that a ball is thrown vertically upwards with initial speed 
12ms~!. Assume that its subsequent motion is modelled as having a 
constant acceleration of —9.8ms~?, where the positive direction along the 
line of motion is upwards. 


5 Particular antiderivatives 


Let the acceleration, velocity and displacement of the ball at time ¢ (in 

seconds) after it was thrown be a (in ms”), v (in ms~!) and s (in m), 

respectively, where displacement is measured from the point from which 
the ball was thrown. 


(a) Find an equation that expresses v in terms of t. 
(b) Hence find an equation that expresses s in terms of t. 


(c) Use the formulas that you found in parts (a) and (b) to find the 
velocity and the displacement of the ball 1 second after it was thrown. 

(d) Use the formula that you found in part (b) to determine how long it 
takes for the ball to fall back to the point from which it was thrown. 


Hint: At the time when the ball has fallen back to this point, what is 
the value of s? 


The next activity involves an economic model of a type that you met in 
Unit 6. You saw there that marginal cost is the derivative of total cost. 


Activity 34 Using integration to work with marginal cost and total 
cost 


A company that produces agricultural fertiliser has developed a model for 
its production costs. According to the model, if the amount of fertiliser 
that the company is currently producing each week is q (in tonnes), then 
the marginal weekly cost m (in £ per tonne) of producing extra fertiliser is 
given by the equation 


m = 250 — 0.5. 
The model applies for values of q between 50 and 200. 


Currently the company produces 80 tonnes of fertiliser each week, at a 
total weekly cost of £30400. 


Let the total weekly cost of producing q tonnes of fertiliser be t (in £). 


(a) By using the fact that marginal cost is the derivative of total cost, find 
a formula for t in terms of q. 


(b) Use the formula that you found in part (a) to find the total weekly 
cost of producing 160 tonnes of fertiliser. 


(c) What is the cost per tonne (in other words, the unit cost, not the 
marginal cost) of producing the fertiliser if the company produces 
80 tonnes of fertiliser each week? What is it if it produces 160 tonnes 
each week? 
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5.3 Changes in the values of antiderivatives 


There’s a fact about the antiderivatives of a continuous function that it’s 
important to appreciate. 


To understand this fact, consider again the function f(x) = 2x, and its 
antiderivatives, the functions of the form F(x) = z? + c, whose graphs are 
shown in Figure 33. 


Figure 33 The graphs of a particular continuous function and some of its 
antiderivatives 


Let’s choose any two numbers in the domain of the function f, say 1 

and 2, as shown in Figure 34(a), and consider the amounts by which the 
antiderivatives of f change as x changes from the first number to the 
second number. Figure 34(b) illustrates the amount of the change for each 
of two different antiderivatives of f. You can see that the amount of the 
change appears to be 3 units in each case. 


The important fact to appreciate is that the amount of the change is the 
same for all the antiderivatives of f. That’s because the graphs of all the 
antiderivatives are vertical translations of each other, and translating a 
graph vertically doesn’t affect how much it changes from one value of x to 
another. 


Another way to see that all the antiderivatives of the function f(x) = 2x 
change by the same amount as x changes from x = 1 to x = 2 is to use 
algebra. Consider any antiderivative F of f. Then F is of the form 
F(x) = z? + c, where c is some constant. The value of F when x = 1 is 


FQ) = +e, 
and the value of F when x = 2 is 


F(2)=2? +c. 


So the change in the value of F as x changes from x = 1 to x = 2 is 
F(2) — F(1) = (27 +e) — (1° +0) 
=4+c—-l-e 
= 3. 


Amount of 
change 
= 3 units 


Amount of 
change 
= 3 units 


Figure 34 (a) The numbers 1 and 2 in the domain of f(x) = 2x (b) The 
amounts that two different antiderivatives of f(x) = 2x change as x 
changes from 1 to 2 


You can see that no matter what the value of c is, the two occurrences of it 
will cancel out when you calculate F'(2) — F(1). Hence you'll always get 
the same answer for F'(2) — F(1), namely 3. 


In general, we have the following important fact. 


Suppose that f is a function that has an antiderivative, and f is 
continuous. If a and b are numbers in the domain of f, then all the 
antiderivatives of f change by the same amount as x changes from 
x =a to x =b. In other words, the value of F(b) — F(a) is the same 
for every antiderivative F of f. 


So if you want to work out the amount by which an antiderivative of a 
continuous function f changes from x = a to x = b, then you just need to 
find any antiderivative F of f, and calculate F(b) — F(a). 


You can use this method to solve problems in which you know the values 
taken by the rate of change of a quantity over some period of change, and 
you want to find the amount by which the quantity changes from one point 
in this period to another. Here’s an example. 


5 Particular antiderivatives 
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Example 22 Finding a change in the value of an antiderivative 


Suppose that the rate of change of a quantity is given by the function 
a= ta” (x > 0), as shown in Figure 35. By what amount does the 
quantity change from x = 3 to x = 6? 


Solution 
@. Find any antiderivative F of f and calculate F(6) — F(3). & 


An antiderivative of f is 


Ea) t x ir’ = me 


So the change in the quantity from x = 3 to x = 6 is 


— iz? (x >0 
y= 52" (x20) F(6) — F() = (& x 6) — (x 3°) 


= 1) 
=p p 
Y = = 12.6. 
3 
2 
| 
1 | | Here’s a similar activity for you to try. 
Coy ea, m E l TE 
Activity 35 Finding a change in the value of an antiderivative 
Figure 36 The graph of Suppose that the rate of change of a quantity is given by the function 
f(z) = e7/8 (x > 0) f(x) = e*/8 (x > 0), as shown in Figure 36. Find the amount by which the 


quantity changes from x = 2 to x = 5, to three significant figures. 


In the next example and activity, the changing quantity is the 
displacement of a moving object, and hence the rate of change of the 
quantity is the velocity of the object. 


The example is about a car whose velocity is increasing, as shown in 
Figure 37. 


Example 23 Finding a change in displacement 


Suppose that a car begins to move along a straight road, and its 
velocity v (in ms~‘) during the first five seconds of its journey is 
given by v = f(t), where 


i Be ae = 
time, t (seconds) eee 
Here t is the time in seconds since the car began moving. How far 
does the car travel from the end of the fourth second to the end of the 
Figure 37 The graph of fifth second? 
v =3t (01s 5) 
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5 Particular antiderivatives 


The activity below is about an object whose velocity is decreasing, and 
then becomes negative, as shown in Figure 38; that is, it starts to move in 


the opposite direction, as shown in Figure 39. m j f 
acl 
# 3 
— = — — 
® g ® a a 24 
ae Pe yea en eR enna a ey ee E ay om eee 5 
position position position 3 : 
at time at time at time D N3 6 z 
= >” —]- : 
t=6 t=0 t=2 5 timd, t (seconds) 
| en 
2er | 
Figure 39 The object whose motion is given by Figure 38 is moving in =e , 
the positive direction at time t = 0, then turns at time t = 2 to move in D 
the negative direction g4 


Activity 30 Finding changes in displacement Fiewsg? “The maphof 


=3— 3 
Suppose that an object moves along a straight line, and its velocity v (in PGES 1S n) 


ms‘) at time t (in seconds) is given by v = f(t), where f(t) = 3— 3t 
(0 <t <6), as shown in Figure 38. 


(a) By what amount does the displacement of the object change from time 
t = 0 to time t = 1? What is the distance that the object travels in 
that time? 


(b) Repeat part (a) for the times t = 4 and t = 5. 


The ideas that you’ve met in this subsection will be important in Unit 8. 
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6 Antiderivatives of further standard 
functions 


In this section you'll meet and use formulas for antiderivatives of some 
further standard functions. These formulas are all found by using formulas 
for derivatives that you met earlier in this unit. 


6.1 An antiderivative of the reciprocal function 


We’ll begin by finding a formula for an antiderivative of the reciprocal 
function f(x) = 1/x, which is the only power function that isn’t covered by 
the general formula for the indefinite integral of a power function that you 
met in Subsection 4.2. 


Here’s how you can find an antiderivative of this function, by using what 
you already know about differentiation. 


Remember that you saw earlier in this unit that 


d 1 

T (Inv) = = (1) 
Unfortunately this formula doesn’t immediately give you a formula for an 
antiderivative of the function f(x) = 1/x, because the function f(x) = 1/x 
has domain (—oo, 0) U (0,00), but the formula applies only when z is 
positive (because ln x is defined only when g is positive). However, the 
formula does tell you that an antiderivative of the function 


g(x) == (x >0) 


G(z) = Ine. 


This fact is illustrated in Figure 40. The graph of g(x) = 1/x (x > 0), in 
Figure 40(a), gives the gradients of the graph of G(x) = Ing, in 
Figure 40(b). 


Figure 40 The graphs of (a) g(x) = 1/x (x > 0) and (b) G(x) = ln z 
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6 Antiderivatives of further standard functions 


To deal with the ‘other half’ of the function f(x) = 1/x, you can use the 
fact that equation (1), together with the chain rule, gives 


This formula applies for negative values of x, because In(—2) is defined 
when —z is positive, that is, when x is negative. So it tells you that an 
antiderivative of the function 


h(x)=1/x (<0) 


A(x) = In(-z). 
This fact is illustrated in Figure 41. The graph of h(x) = 1/x (x < 0), in 
Figure 41(a), gives the gradients of the graph of H(x) = In(—z), in 
Figure 41(b). 


(a) 


Figure 41 The graphs of (a) h(x) = 1/x (x < 0) and (b) A(x) = In(—z) 


You can obtain an antiderivative for the function f(x) = 1/z, with its 
whole domain (—co,0) U (0,00), by putting together the antiderivatives 
found for the two ‘halves’ of f. The graph of this antiderivative is shown in 
Figure 42(b). The graph of f(x) = 1/z is shown in Figure 42(a): it gives 
the gradients of the graph in Figure 42(b). 


Figure 42 The graphs of (a) f(x) = 1/x and (b) F(a) = Inz and 
F(x) = In(-z) 
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The formula for the antiderivative of f(x) = 1/x found above can be 
written as 


In(—z) (x <0) 
a) { lng (x > 0). (2) 


However, there’s a more concise way to write it, using the notation | | for 
the modulus of a number. Notice that if x is a negative number, then 


=z = |z| 
(for example, —(—3) = 3 = | — 3|). Also, if x is a positive number, then 
z= |r] 
(for example, 3 = |3|). So formula (2) can be written concisely as 
F(z) = ln |e. 
Hence the indefinite integral of f(x) = 1/z is 
F(x) =In|z|+ c. 
This formula is sometimes useful, but often when you’re working with the 
function f(a) = 1/2, the variable x takes only positive values. If this is the 


case, then you can just use the antiderivative of the function f(x) = 1/x 
(x > 0), which has the simpler formula 


F(x) =lng +c. 


The facts that you’ve seen so far in this section are summarised below. 


Indefinite integral of the reciprocal function 


1 
The indefinite integral of — is In |x| + c. 
T 


iL 
(If x takes only positive values, then the indefinite integral of — is 
x 


simply ln x + c.) 


In the next activity you’re asked to use this formula, together with the 
constant multiple rule and the sum rule for antiderivatives, to find the 
indefinite integrals of some further functions. 


Activity 37 Finding indefinite integrals of more functions 


Find the indefinite integrals of the following functions. 


@) fe) =% >) f@=" Oes (>) 


6 Antiderivatives of further standard functions 


6.2 Table of standard indefinite integrals 


You can use all of the derivatives of standard functions that you met in 
Unit 6 and in the first three sections of this unit to deduce antiderivatives, 
and hence indefinite integrals, of further functions. 


You saw in the last subsection that it isn’t straightforward to use the fact 

that the derivative of In x is 1/x to deduce an antiderivative of 1/x, because 
the domain of the function f(x) = 1/x contains numbers that aren’t in the 
domain of the function F(x) = ln x. Luckily there are no similar problems 

with any of the other derivatives of standard functions that you’ve met, so 
it’s straightforward to use them to deduce antiderivatives. 


For example, you can immediately deduce antiderivatives of the standard 
functions sin, cos and exp, as follows. Since 
the derivative of sin x is cos x, 
it follows that 
an antiderivative of cos x is sina. 
Similarly, since 
the derivative of cos x is — sin x, 
it follows that 
an antiderivative of — sin x is cos z, 
and hence (by the constant multiple rule for antiderivatives) 
an antiderivative of sin x is — cosa. 
Finally, since the derivative of e” is e”, it follows that an antiderivative of 
e” is e”. 


The table in the box below contains all the indefinite integrals of standard 
functions that you’ve met so far, and all the further indefinite integrals 
that can be deduced from the table of standard derivatives in 

Subsection 3.4, in the way just described. You should try to memorise at 
least the ones above the line in the middle of the table. For the others, you 
should try to make sure that you can at least recognise the functions on 
the left as functions that can be integrated easily, even if you continue to 
refer to the table for their indefinite integrals. This table of standard 
indefinite integrals is also included in the Handbook. 


You should 
integrate more! 


vve tried, but it doesw't 
make any difference! 
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Function Indefinite integral 
a (constant) ay E 
1 
xr” (n Æ-—1 pi g 
C ae ai =f 
il 
— In|z| +c 
96 
or Inz +c, for x > 0 
e7 e” +c 
sin x OEE 
cos £ sing +c 
sec? x tanz+c 
cosec? x = COB E 
sec x tan x SCG Wai 
cosec x cot © — cosec x +c 
1 aia 
— sin -z +c 
Ji pa 
l-g or — cos! r Fc 
: tan" x + 
— an -x+c 
14+ 2? 


In the next activity you’re asked to use these formulas, together with the 
constant multiple rule and the sum rule for antiderivatives, to find the 
indefinite integrals of some further functions. In some of the parts you 
have to start by using an algebraic technique to express the function in a 
form that you can integrate. 
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Activity 38 Finding indefinite integrals of more functions 


Find the indefinite integrals of the following functions. 


(a) h(0) = 3 cos 0 + 4sin 8 (b) g(¢) = 6 — 5 cosec? ¢ w) Fas T 
Oss; (f= e>) (H fe) = 
1 1 1 


V4 —4Ax? 
x—2 _#-2 


G) a) = 5(a@—3)2e—-1) (k) f(x) = (0) glz) = 3 


(m) f(x) = 8e” (n) f(e) = e+" (0) r(¢) = —cosec dcot o 
(p) r(0) = sec 8(sec 6 + tan 0) 


You'll learn much more about integration in Unit 8. In particular you’ll 
meet a surprising and useful alternative way to understand it. You’ll also 
meet some further methods for combining antiderivatives of functions to 
obtain antiderivatives of more functions. 


Learning outcomes 


After studying this unit, you should be able to: 
e remember the formulas for the derivatives of some standard functions 


e differentiate a wide variety of functions, including products, quotients 
and composites 


e use differentiation to solve simple optimisation problems 
e find and use derivatives using a computer 


e understand and use the relationship between the derivative of an 
invertible function and the derivative of its inverse function 


e understand that integration reverses differentiation 


e understand what’s meant by an antiderivative and the indefinite 
integral of a function 


e remember the formulas for the indefinite integrals of some standard 
functions 


e use the constant multiple rule and the sum rule for antiderivatives 


e use integration to work with the relationship between a changing 
quantity and its rate of change 


e in particular, use integration to work with the relationships between 
displacement, velocity and acceleration. 
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Solutions to activities 


Solution to Activity 1 


The graph of the derivative of the sine function 
for « between 0 and 27 must look something like the 
sketch below. 


Sv 


(You can work this out as follows. 


When x = 7, the gradient of the graph of the sine 
function seems to be about the same as the gradient 
of the line y = —zx; that is, it seems to be about —1. 
As x increases, the graph gradually gets less steep — 
that is, its gradient takes negative values of smaller 
and smaller magnitude, until x = 37/2, when the 
gradient seems to be zero. The gradient changes 
slowly when z is only a little larger than 7, but 
changes more rapidly as x gets closer to 37/2. 


In other words, for x between m and 37/2 the value 
of the derivative seems to increase from —1 to zero, 
slowly at first, but then more rapidly. 


When z increases from 37/2, the gradient starts at 
zero, then it becomes positive. At first the graph is 
not very steep — that is, the gradient has small 
positive values — but it becomes steeper and steeper 
— that is, the gradient takes larger and larger 
positive values. Eventually, when æ = 27, the 
gradient seems to be about the same as the gradient 
of the line y = x; that is, it seems to be about 1. 
The gradient increases fairly rapidly when g is only 
a little larger than 37/2, but increases more slowly 
as x gets closer to 27. 


So, for x between 37/2 and 27, the value of the 
derivative seems to increase from 0 to 1, fairly 
rapidly at first, but then more slowly.) 
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Solution to Activity 2 

(a) f(x) = sing + cos z, so 
f'(x) = cos x — sin z. 

(b) glu) = u? — cos u, so 
g'(u) = 2u — (— sin u) 


= 2u + sinu. 
(c) P = 6tanð, so 
dP 3 
u“ 6 sec* 0. 
(d) r = —2(1+ sin ¢), so 
dr 
— = _9 
T (0 + cos œ) 
= —2cos@. 


Solution to Activity 3 
(a) f(x) = e” + 1n z, so 
1 
f'(x) =e? +. 
i 
(b) h(r) =r — cosr — 3lnr, so 


iC ae E 
- 


Se 
p 
1 
(c) v=_tint so 
dv 1 1 
dt È t 
1 t 
PTE 
t—1 
t2 
(d) w = 5 — 3e”, so 
dw 3e" 
— = —3e 
du 


Solution to Activity 4 


(a) The function is k(x) = x tan z. 
Let f(x) =x? and g(x) = tan z. 


Then f'(x) = 3x? and g'(x) = sec? z. 


By the product rule, 
ki(x) = f(x)g'(x) + g(x) f'(x) 
= z? sec? x + (tan x) x 3x” 
= g? sec? x + 3x" tan z 
= x*(xsec? z + 3tanz). 
(b) The function is k(x) = ze”. 
Let f(x) = x and g(x) = e”. 
Then f'(x) = 1 and g'(x) = e”. 
By the product rule, 
k'(x) = f(z)g' (£) + g(x) f'(x) 
= re +e” x1 
= ge” + e” 
(x + 1)e”. 


Solution to Activity 5 
The function is k(x) = 2? x xè. 
Let f(z) = xz? and g(x) = 2°. 
Then f'(x) = 22 and g'(x) = 3a’. 
By the product rule, 
k'(x) = f(z)g'(£) + g(x) f'(x) 

= 2? x 32? + 2? x Qe 

= 324 + 224 

= 5g. 
This is the same answer that you obtain by 
differentiating k(x) = zë directly, using the usual 
rule for the derivative of a power function. 


Solution to Activity 6 
(a) k(x) = (3x? + 2x + 1)e”, so, by the product rule, 
k'(x) = (8a? + 2x + 1)e* + e” (6z + 2) 
= (3x? + 22 + 1+ 6r + 2)e* 
= (3a? + 8x + 3)e”. 


(b) k(x) = sin z cos z, so, by the product rule, 
k'(x) = (sin z)(— sin x) + (cos x)(cos x) 
= cos” x — sin? z. 
(c) k(z) = zsin z, so, by the product rule, 
k'(z) = zcosz + (sin z) x 1 


= z cos z + sin z. 


(d) v = (2t? — 1) cost, so, by the product rule, 
d 
= = (2t? — 1)(— sint) + (cost) x 4t 
= (1 — 2¢”) sin t + 4t cost. 


Solutions to activities 


(e) m = (u? +3) Inu, so, by the product rule, 


d 1 
TL = (u? +3) x Z + (Inu) x 2u 
2 
=s- Tiano 
u 


u u 
u? +3 + 2u? Inu 
u 
u2(1+2Inu)+3 
—— 
(£) y= /esinez = x!/? sin z, so, by the product 
rule, 


1 
— = 7/2 cosg + (sin £) x 5 r71? 


gs 
= y £ cos r + ——= 
2,/T 


_ 2xcosx+sin x 


= NA 


Solution to Activity 7 

If y = xe”, then, by the product rule, 
dy _ 
TE 

= g(x + 2)e”. 

Substituting x = —1 gives 

dy —1 1 

ma (—1)(-14+ 2)e™ = -7 

So the gradient is —1/e, which is approximately 

—0.37. 

(The graph of y = x?e® is shown below. You can see 


that the gradient at the point with x-coordinate —1 
does seem to be approximately —0.37.) 


r e? + e” x Wer 
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Solution to Activity 8 (c) r+1 


m so, by the quotient rule, 
ae: e” z 
(a) The function is k(x) = Pi dz (r=1)x1-(r+1)x1 
Let f(x) = e” and g(x) = x. dr (r — 1)? 
Then f'(x) = e” and g'(x) = 1. r—=l-r-1 
So, by the quotient rule, = (r — 1} 
wa) = = SOLO) - Hel) 8 
(g(x))? ~ (r=)? 
_ reve" x I 0 
ee (d) m= Tp 8% by the quotient rule, 
_ (@ ae dm _ (cos) x 1— 6(~sin8) 
_ r dð (cos 0)? 
EN x 
(b) The function is k(x) = aal _ cos 0 + sin 0 
Let f(x) = z? and g(x) = 22 + 1. cos? 0 


Then f'(x) = 3x? and g'(x) = 2. 


À 1+ @tand 
So, by the quotient rule, "ae 
k'(x) = = gla) f(x) — f(x)g'(x) (The final expression here is obtained by 
(g(x) )? dividing the top and bottom of the previous 
_ (22+ 1) x 3r? — r? x 2 expression by cos @. The previous expression 
(2x + 1)? would also be an acceptable final answer.) 
= 2 Int 
= ee (e) p(t) = aa so, by the quotient rule, 
r+ 
(62 +3- 2z)z? Py : — (Int) x 2 
AS 1 — 
(2x +1)? p (t) = ez 
_ (4a + 3)a* t—2tInt 
Qt?" =a 
i Ia 1—2lnt 
Solution to Activity 9 = = 
e” : 
k= a O Py She aoti Fal Solution to Activity 10 
f@)= a*e* — e” (3x?) (a) By the quotient rule, 
(z3)? d C= d (2) 
_ 2 (a — 3)e* da 7 ~ ae \sing 
gh p z (sin x)(— sin x) — (cos x) (cos x) 
= Eo (sin x)? 
z 
sin? x + cos? x 
=| ee eae ea 
(b) g(u) = ZS so, by the quotient rule, sin? z 
eu 
1 
jee (e”)x1-—(u-— Te" =- 
g (u) _— (ev)? sin . 
B (1—u+1)e” = — COSEC T. 
~ R 
_2-u 
sora 
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(b) By the quotient rule, 


2 (sec z) = = d 
dx ~ dg \cosx 


_ (cosx) x O— 1x (—sinz) 


(cos x)? 


sin x 


cos? x 


O 1 sin x 
~ \cosa COS x 


= sec g tan z. 


(c) By the quotient rule, 


L (cosee) = È (= 
dx eee dx \sinx 


(sinx) x 0— 1 x cosg 
Oo a 


(sin x) 


1 ( Cos £ ) 
sin x sin x 
= — cosec x cot T. 


Solution to Activity 11 


(a) y = Infa) 

so y = lnu where u = zt. 
(b) y = sin(x? — 1) 

so y = sin u where u = z? — 1. 
(c) y =cos(e*) 

so y = cosu where u = e”. 
(d) y = cos? z 

so y = u? where u = cos 7. 
(e) g=e"% 

so y = e” where u = tan z. 
(£) y= vlonz 

so y = yu where u = ln z. 
(8) y = (x + V2) 

so y = u? where u = g + ya. 
(h) y= Vo? -x-1 

so y = #/u where u = 2? — x — 1. 
@ y= 

so y = e” where u = yz. 


Solutions to activities 


Solution to Activity 12 
(a) y=e°*, so 
y = e” where u = 52. 


This gives 
dy du 
— = e” — = 5 
du ^? da 
By the chain rule, 
dy dy du 
dx du dx 
=e"x5 
=e x5 
=he*, 


(b) y = sin(2zx? — 3), so 


y = sinu where u = 22? — 3. 


This gives 

dy du 

A = COSU, in = Ag. 
By the chain rule, 

dy dy du 

dx du dx 


= (cosu) x 4x 
= (cos(2x? — 3)) x 4a 
= 4x cos(2x? — 3). 

(c) g=sm (=), so 


. x 
y = sin u where u = ri 


This gives 
dy _ cos u a 
du " de 4 
By the chain rule, 
dy dy du 
dx du dz 
= (cosu) x 
E 4 
=3% (3) 
=3s (7) 


(d) y = cos(e”), so 


y = cosu where u = e”. 


This gives 
dy ʻi du 5 
— = —sinu, — =e 
du ’ dæ 
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By the chain rule, 
dy dy du 
dr du da 
= (— sinu) x e” 
= (—sin(e”)) x e” 
= —e” sin(e”) 
(e) y=In(sinz), so 


y = lnu where u = sin z. 


This gives 
dy 1 du 7 
— = —, — =cCosT. 
du u’ dg 
By the chain rule, 
dy dy du 
dx du dz 
1 
= — X COST 
u 
1 
= —— X COST 
sin £ 
_ cose 
sing 
= cot z. 
(f) y = sin? z = (sin x)’, so 
y = u? where u = sin z. 
This gives 
d d 
7 = Su", T = COS T. 
u x 
By the chain rule, 
dy dy du 
dx du dz 
= 3u? cos x 


= 3(sin x)? cos x 
= 3sin? z cos z. 
Solution to Activity 13 
(a) f(x) = cos(4x), so, by the chain rule, 
f'(x) = —sin(42) x 4 
= —Asin(4z). 
(b) f(x) = e?"+1, so, by the chain rule, 


(c) r(x) = cos? z, so, by the chain rule, 
r(x) = (5cos* x)(— sin x) 


= —5 sin z cos‘ x. 
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(Remember that cos°® x means (cos x)°.) 


(d) g(x) = ln(z? +x +1), so, by the chain rule, 


g'(x) = x (22 + 1) 


1 
x27+ar+1 
_ 2e+1 
— ett et 1 
(e) v(t) =sin(e’), so, by the chain rule, 


v' (t) = cos(e’) x e 


= æ cos(e’). 
(£) v =In(r°), so, by the chain rule, 
dv 1 4 
dr = r5 x or 
5 
= 


(Here’s an alternative way to differentiate this 
function, without using the chain rule: 


v =In(r°) =5Inr, so, by the constant multiple 


rule, 
du PP - +5 
T 


(g) r = tan? 0, so, by the chain rule, 
T = (2 tan 0) x sec? 


= 2 tan ĝ sec? 0. 


(h) R= et®™4, so, by the chain rule, 


dR 
dq = eng sec? q. 

(i) A= (1+ p°)? so, by the chain rule, 
dA 
— = 9(1 + p°)? x2 
ape ee 


= 18p(1+ p°)’. 
Solution to Activity 14 
(a) f(x) = Vina = (Inz)'/?, so, by the chain rule, 
fi(o) = iai 


1 

~ QeV/Ina 

1 
(b) p(w) = ws +7) > (w? + 7)~4, so, by the 
chain rule, 
p(w) = —4(w? + 7)~° x 3w? 

o 12w? 

(w3 + 7)5- 


(c) f(x) = cos(/x) = cos(xt/?), so, by the chain 


rule, 
f'(z) = —sin(2'/*) x r7? 
sin(/x) 
— F 
(a) fle) = = (6*)-1/2 ¢-*/2, 50, by the 
chain rule, 
fie) =e??? x (=}) 
= —1¢-2/? 
O 1 
=- 
(This solution uses the index laws (a™)” = a™” 


and a™/” = Yar.) 


1 
(e) a) = Gas 
so, by the chain rule, 
a' (t) = —4(2 — 3t — 3t?)~*/8 x (—3 — 6t) 
1+2t 
(2 — 3t — 3t?)4/3° 


(£) C(p) = e!/P” =e”), so, by the chain rule, 


C' (p) =e *) x (—2p7?) 
-3 61/2? 


= (2 — 3t — 3?) 1, 


2el/P° 


p 

Solution to Activity 15 
(a) k(x) = cos(7x), so 

k'(x) = —7sin(7z2). 


Solutions to activities 


w(x) =. 


w r 
(Here’s an alternative way to differentiate this 
function: 


w(x) = ln(7x) = ln 7 + lng, so 


w(x) =0+ 2 = = 


(h) k() = tan(3¢), so 

k'(¢) = 3sec?(3¢). 
Solution to Activity 16 
(a) k(x) = cos(7x + 4), so 

k' (x) = —7sin(7x + 4). 


Solution to Activity 17 
(a) g(x) = (2+1) +1) 
=e +r’ +r +l, 
so 
g'(x) = 5f + 38x? + 2x 
= «(5x + 3x + 2). 
(An alternative method is to use the product 
rule.) 
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T 


(b) f(x) = (@ — 28 SO 
(2-2)? x 1- ro ((x — 2)3) 
(x — 2)° 
(by the quotient rule) 
(x —2)3 x 1— xx 3(x-— 2)? x 1 
(x — 2)° 
(by the chain rule) 
(x — 2)° — 32(x — 2)? 
(x — 2)° 
(a — 2) — 3a 
m- 
== 2 
= Ge 98 
2(x + 1) 
(z — 2)*" 


(c) g(x) = cos(x ln x), so 


f(z) = 


g'(x) = — sin(x ln z) L (xln zx) 


dz 
(by the chain rule) 
= —sin(zln z) (« x id + (Ina) x i) 
t 
(by the product rule) 
= —sin(zln z)(1 + ln z). 
(d) h(x) = e*/? sin(3z), so 
d d 
/ = ata M a : — {2/2 
h(a)=e T (sin(3x)) + sin(3z) T (e ) 
(by the product rule) 


= e*/? x 3cos(3x) + sin(3x) x le”/? 
(by the chain rule) 


= 3e7/? cos(3x) + le”/? sin(3x) 


= te®/?(6 cos(3x) + sin(3z)). 
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d d 
ou T (cos(4u)) — cos(4u) ae (e3") 
(ers 2 
(by the quotient rule) 
e?" x (—Asin(4u)) — cos(4u) x 3e3% 
(e34)? 
(by the chain rule) 
—4e?" sin(4u) — 3e?” cos(4u) 
eou 


—e>"(4sin(4u) + 3cos(4u)) 
eou 
Asin(4u) + 3 cos(4u) 


eau 
(£) f(z) = ef"), so 
f'(a) =e”) x 5cos(5x) 
(by the chain rule) 
= 5e*"(5") cos(5x). 
(g) h(x) = cos?(x?), so 
h'(x) = 3 cos? (x?) < (cos(x?)) 
(by the chain rule) 
= 3cos?(a#?) x (—sin(x?)) x 2x 
(by the chain rule again) 


= —6x cos? (x°) sin(x’). 


e72t 
(haas z = te? so 
z'(£) = 4 x (—2e7*") 
2 
= —i e722 
5 


(i) G(x) = 3sin z cosg + 2sin(3x), so 
G' (x) = 3((sin x)(— sin x) + (cos x) (cos x)) 
+ 2(cos(3x)) x 3 
(by the product rule) 
= —3 sin? x + 3 cos? z + 6 cos(3z). 
(You can simplify this answer further by using 
the trigonometric identity 
cos(20) = cos? 6 — sin? 6: 
— 3sin? z + 3cos” x + 6 cos(3z) 
= 3(— sin? x + cos? x) + 6 cos(3z) 
= 3 cos(2x) + 6 cos(3z).) 


(j) p(t) = te sint = t(e’ sin t), so 
d 
p(t)=t T (e* sint) + (e sint) x 1 
(by the product rule) 
= t(e* cost + (sin t)e*) + e sint 
(by the product rule again) 
= te’ cost + te’ sint + e’ sint 
=e'(tcost +tsint + sint). 


(Alternatively, you could start by writing 
p(t) = (te’) sint, and again apply the product 
rule twice.) 


(k) r =sin(26) cos 0, so 


= sin(26)(— sin 8) + (cos 8) x 2c0s(26) 


(by the product rule) 
= 2cos 0 cos(26) — sin 0 sin(290). 
(1) have", so 
E ne EE 
dz 
(by the product rule) 
= 3ze** + e?” 
= (3z + l)e*. 
sin /u _ sin(u!/?) 
yu — u2 


d 
dy u? a (sin(ul/?)) — sin(ul/?) x 4471/2 


du u 
(by the quotient rule) 


ul/? cos(ul/?) x $u7t/? — sin(ul/?) x u71? 


u 


(m) v = 


, SO 


(by the chain rule) 


cos(u!/?) — u7 t? sin(u!/?) 
2u 
ut/? cos(ul/?) — sin(u!/?) 
= 243/2 
_ yucos yu — sin yu 
7 Qu /u 


Solutions to activities 


Solution to Activity 18 


(a) g(x) = e~**, where k is a constant, so 
g(x) = —ke—**. 
(b) f(a) = xcos(rx), so, by the product rule, 
f'(x) = x x (—7sin(mz)) + (cos(rx)) x 1 
= -ng sin(7x) + cos(mx) 
= cos(mz) — mz sin(az). 
h = sin? (a0), where a is a constant, so, by the 
chain rule, 
dh 
u 2sin(a@) x a cos(a) 
= 2a sin(a) cos(a0). 
(You can use the trigonometric identity 
sin(20) = 2 sin 0 cos @ to write this answer above 
in a neater form, namely a sin(2a0).) 
(d) y = cos(3x) + c, where c is a constant, so 
d 
z = (—sin(3z)) x 340 
= —3sin(3z). 
Solution to Activity 19 
lnt 
The function is p(t) = r =a lat: 
So, by the product rule, 


fiat? z + (Int)(—2t-3) 


1 2lnt 

BE B 
ll —2Int 
= 3 


Solution to Activity 20 


Let the length of the sides of the squares be x 
(in m), as shown below. The value of x must be 
between 0 and 3. 
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Let the volume of the box be V (in m3). 


L=29 


Then the height of the box will be xz, and the length 
and width of the box will each be 1 — 27. Hence 

V =2(1 —2z)?. 
We have to find the value of x, between 0 and $, 
that gives the maximum value of V. 
V is a polynomial function of x, so it is continuous 
on its whole domain and differentiable at every 
value of x. So the maximum value of V must occur 
either at an endpoint of the interval [0, $], or at a 
stationary point. 


The formula for V is 


V = z(1 — 22)? 
= 2(1 — 4x + 42) 
= z — 4r? + 4r’, 
sO 
Y 4 Re ie 
dx 


At a stationary point, dV/dz = 0, which gives 
1 — 8z + 122? =0 
1227 — 8r +1 = 0 


(62 — 1)(2x — 1) = 0 


2i _1 
6 = 6 or T = 3 
So the stationary points occur when z = t 
1 


and 
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The endpoints of the interval are 0 and $, so the 
maximum value of V occurs when x = 0, z = i or 
=L 

When x = 0 the height of the box is zero, and when 
L= 4 its length and width are zero, so in both these 
cases the volume V will be zero. When x = 2, the 
length, width and height of the box are all positive, 


so the volume V will also be positive. 


So the maximum value of V is achieved when x = z. 


Hence the sides of the squares should be im; that 
is, about 16.7 cm. 

(A graph of V against x, for x between 0 and S, is 
shown below. You can see that the maximum value 
of V does indeed seem to occur when zg is 
approximately z SO lOG] 


V =2(1 - 2x)? 


volume, 
> 
=} 
A 
L 


02 03 04 05 
side length of squares, x (m) 


0.1 


(Note also that an alternative way to differentiate 
the formula for V, rather than first multiplying out 
the brackets, is to use a combination of the product 
rule and the chain rule, as follows: 

dV d 


a i (x(1 — 2x)°) 
= 2+ ((1 — 22)?) +(1- 2r) x1 


x x 2(1 — 2x) x (—2) + (1 — 22)? 
= —4gz(1 — 2x) + (1 — 2x)? 

= (1 — 2x)(—4r + 1 — 2x) 

= (1 — 2x)(1 — 6x).) 


Solution to Activity 21 


The volume of the channel is the area of its 
cross-section times its (fixed) length. So to achieve a 
channel of maximum volume, we have to achieve the 
maximum area of the cross-section. 


Let the angle between the two sides of the v-shape 
be 6, as shown below. The value of 0 must be 
between 0 and 7. 


Let the area of the cross-section of the channel be A 
(in m?). 


0.5m 0.5m 


- 


Each side of the v-shape has length $m, so, by the 
formula A = tab sin 0 for the area of a triangle, 


A= 4 x i x $ sind = $ sin 6. 
We have to find the value of 0, between 0 and 7, 
that gives the maximum value of A. 


Since the sine function is continuous on its whole 
domain and differentiable everywhere, so is the 
variable A as a function of 6. So the maximum value 
of A must occur either at an endpoint of the interval 
[0, 7], or at a stationary point. 


The formula stated above for A gives 
dA 
u 3 cos 6. 
At a stationary point, dA/d@ = 0, which gives 


(since @ is in the interval [0,7]). 
So the stationary point occurs when 0 = 7/2. 


The endpoints of the interval are 0 and 7, so the 
maximum value of A occurs when 0 = 0, 0 = 7/2 or 
J=. 

When 0 = 0 the width of the channel is zero, and 
when 0 = 7 its height is zero, so in both these cases 
the area A will be zero. When 0 = 7/2, the width 
and height of the channel are both positive, so the 
area A will also be positive. 

So the maximum value of A is achieved when 

Q= n2: 

That is, to give a channel with maximum volume, 


the angle between the sides of the v-shape should be 
m/2, which is 90°. 


Solutions to activities 


(A graph of A against 0, for 6 between 0 and 7, is 
shown below. You can see that the maximum value 
of A does indeed seem to occur when @ is 
approximately 7/2.) 


ee A 

= 

x 0.154 A= ġsinð 

a 

fav) 

0.05 4 

T 7 
J angle, 6 


(Notice that in fact you could have answered this 
question without using calculus, because, from what 
you saw in Unit 4, the maximum value of sin 0 for 
values of 0 between 0 and m occurs when 0 = 7/2. It 
follows that the maximum value of A = + sin 0 for 
values of 0 between 0 and m also occurs when 
G=7/2.) 


Solution to Activity 23 


(a) (You need to do this on a computer.) 


(b) 


N 
Xy 


(The symbol consisting of two short slant lines on 
the y-axis of this graph indicates that part of the 
axis has been omitted.) 


(c) The value of x that gives the minimum value of 
f(x) seems to be about 0.6. 


(d) The derivative of f is 


f'(a) l 


aC 
 8f/ttae 16° 
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(e) The solution of the equation f'(x) = 0 is 
x = 0.577 (to 3 s.f.). 


(f) The man should join the road 0.58 kilometres 
(to the nearest 10 metres) along from the point 
that is closest to his initial position. 

(Because the graph of f indicates that the minimum 

value of f(x) occurs at a local minimum rather than 

at one of the endpoints of the interval [0,2], in the 

solution above we don’t consider the values of f (0) 

and f(2).) 

(Details of how to use the CAS for this question are 

given in the Computer algebra guide, in the section 

‘Computer methods for CAS activities in 

Books A—D’.) 

By-hand solution 

The problem in this question can be solved by hand 

as follows. 


The function to be minimised has the rule 
_ Wee 2a 
I(t) ="43— + 5 
=4(1+27)'/?+ 22-2). 
The variable x can take values between 0 and 2. 


The function f is continuous on its domain (0, 2], 
and differentiable at every value of x in this interval, 
except at the endpoints 0 and 2. (Remember that a 
function can’t be differentiable at an endpoint of its 
domain.) (These facts can be deduced from the fact 
that f is a simple combination of functions that 
have these properties. You can learn more about 
making such deductions in later modules. For now, 
you can see from the graph of f that it appears to 
have these properties. ) 

Hence the minimum value of f on the interval [0, 2] 
occurs at a stationary point of f or at an endpoint 
of the interval. 


Differentiating f gives 
f'(a) =4x 404-2?) 1? x 22 + 4 x (-1) 
1 


Ri 
8V1 +z? 16° 
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At a stationary point, f'(x) = 0, which gives 


x 1 
8V1+ x? 16 
x 1 


8V1+a2 16 
l6x = 8y 1 + z? 


(2r)? =14 2? 
4r? =1+4+ 2? 
3a? = 1 
Poe 

3 

1 
x 


(Here we have used the fact that x > 0.) 


So the only stationary point of f is x = 1//3. The 
value of f at this stationary point is 


(3) Se 


+5 2V3-1 
8 16/3 


oe 1 

8 16/3 

= 2 23-1 

TV3 6v3 

_ _2v3_ , 2v3v3~ v3 
8V3V3 16V3V3 

6- v3 


24 48 

4/3 6-7v3 

~ 4&8 48 

_6+3v3 

© 48 

_ 243 aias 
16 


The values of f at the endpoints of the interval [0, 2] 
are as follows: 


fe 
~~ 8 16 
S g a en 
8 8 4 ` 


vB, _ v5 


= — +0= — £0.28. 
8 a 8 


So the minimum value of f(x) is achieved when 

x = 1/V/3 = 0.577 (to 3 s.f.). Hence, as was found 
above using a computer, the man should join the 
road 0.58 kilometres (to the nearest 10 metres) along 
from the point that is closest to his initial position. 


Solution to Activity 24 


(a) We have y = cos~! x, so 


z = cosy. 
Therefore 
da . 
dy = — siny. 
By the inverse function rule, 
dy _ 1 
dr siny’ 


provided that sin y Æ 0. 
The identity sin? y + cos? y = 1 gives 


siny = +y 1 — cos? y = +y 1 — x?. 
The + sign applies here, because y takes values 
only in the interval [0, 7] (since y = cos™ t (x)) 
and so sin y is always non-negative. Hence 


siny = y1 -— z?. 


Therefore 
dy _ 1 
d£ Vi- a?’ 


provided that x Æ +1. 
(b) We have y = tan™t z, so 

x = tany. 

Therefore 

Z 2 

T = sec” y. 

By the inverse function rule, 
dy 1 
dz  sec?y 


The identity tan? y + 1 = sec? y gives 
sec? y = 1 + tan? y = 1 + x°. 


Therefore 
dy 1 
d£ 142?" 


Solutions to activities 


Solution to Activity 25 


(a) f(x) = sint (3x), so, by the chain rule, 
1 
‘(c) = ———— x3 
f(x) TEE 
3 


VI — 922 


(b) f(x) = e? cos! x, so, by the product rule, 


1 
==) + (cos~* x)e* 
x 


1 
= (cos! L— ==) e”. 
Solution to Activity 26 
(a) F(x) = x°, so F' (x) = 2x. 
(b) F(x) = 27 +3, so F’(x) = 22. 
(c) F(z) = 2? — 2, so F’(x) = 2a. 


Solution to Activity 27 


(a) Since 

< ($sin(2x)) = $ x 2cos(2x) = cos(2z), 
it follows that F(x) = 4 sin(2z) is an 
antiderivative of the function f(x) = cos(2z). 
The indefinite integral of f(x) = cos(2z) is 


F(x) = $sin(2z) + c. 


An antiderivative of f(x) = cos(2x), other than 
the antiderivative in part (a), is 

F(x) = $sin(2z) + 1. 

(You probably chose a different antiderivative. 
Any function obtained by setting the constant c 
in the indefinite integral F(x) = $ sin(2x) + c 
equal to a particular number will do.) 


Solution to Activity 28 
(a) The indefinite integral of f(x) = x° is 
F(a) = 2 +c. 


10 
1 
(b) The indefinite integral of p(u) = — = u-® is 
u 
l -2 


(c) The indefinite integral of f(x) = xz?’ is 
F(z) = 375/3 ees 
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(d) The indefinite integral of f(x) = Yr = x'/4 is 
P(g) = 45/4 aa 

(e) The indefinite integral of g(t) = t-?/3 is 
G(t) = 3t +c. 


1 
(£) The indefinite integral of b(v) = T Say Zis 


B(v) = W? +e =2yVv +c. 
Solution to Activity 29 
(a) The indefinite integral of f(x) = x° is 
F(a) = xf +c. 


(b) Three different antiderivatives of f are 


F(x) = zô, 
F(a) = xf +1, 
F(a) = $x? — 2. 


(You probably chose different examples.) 


Solution to Activity 30 
(a) The indefinite integral of f(x) = 82° is 
F(x) =8 x tzt +c 


= 2z +c. 
(b) The indefinite integral of f(x) = 7 is 
F(a) =7xz+c. 


(c) The indefinite integral of 
f(a) = VT — r? = z? — z? is 


F(x) = 2g3/2 - 4z’ +c 
= £(2x3/? —x*)+e. 
(d) We have 
f(x) = (x + 2)(z — 5) 


= z? — 3x — 10, 
which has indefinite integral 
F(x) = 4x? — 32? — 10r + c. 
(e) We have 
g(t) = (t+ 1) 
= +98 41, 
which has indefinite integral 
Gt) =40 +P Htc. 
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(£) We have 
x? +1 1 1 
g(a) == tare, 
which has indefinite integral 
G(x) = -xt — 4r +c 
a oe 
=] E = Se FE: 


f(r) = ¥r (1012 = =) 
= r1/3 (10r? -= r7?) 
= 10r7/3 — r753 
which has indefinite integral 
F(r) = 10(37r?°/8) — (3r?) +e 
= 3710/3 4 3p=2/3 n 


= 3r=2/3(1 + 2r*) +e. 


Solution to Activity 31 
(a) The function is f(x) = x + 2. Its indefinite 
integral is 
F(a) = $27 +22 +c. 
Using the fact that F(1) = 3 gives 
5xX1?74+2x1l+c=3 


Hence the required antiderivative is 


F(a) = $07 +2042. 


(b) The function is f(x) = 1/2? = x7? with domain 
(0,00). Its indefinite integral is 


l 1 
Using the fact that F(3) = —} gives 
1 S 
2x32 9 
+ 
J es C= —— 
18 9 
1 1 1 
c=--4+—=-—. 
9 18 18 
Hence the required antiderivative is 
1 1 
F(x) =-~>~- — 
(2) =—9 3 — ig 
94+ x? 


Solution to Activity 32 


(a) 


The given equation for v in terms of t is 
v= int. 

Hence the equation for s in terms of t is 
s=4x Gt +c, 

that is, 
s= at +c, 

where c is a constant. 

The marble starts rolling from a position 

0.3 metres down the slope, so when t = 0, 

s=0.3. 


Substituting these values of s and t into the 
equation above gives 

0.3 = 3 x 0? +c 

c= 0.3. 
So the equation for s in terms of t is 

s = pt? +0.3. 
Substituting t = 4 into the equation from 
part (a) gives 

s = $ Xx 4° + 0.3 = 0.8 + 0.3 = 1.1. 
That is, the displacement of the marble from 


the top of the slope after it has been rolling for 
4 seconds is 1.1 metres. 


Solution to Activity 33 


(a) 


The equation for a in terms of t is 
a = —9.8. 


Integrating this equation gives the following 
equation for v in terms of t: 


v = —9.8t + c, 
where c is a constant. 


At the start of the motion, the velocity of the 
ball is 12ms™t. That is, when t = 0, v = 12. 


Substituting these values into the equation for v 
above gives 

12 = —9.8 x 0 +c, thatis, c= 12. 
So the equation for v in terms of t is 

v = 12 — 9.8t. 
Integrating the equation found in part (a) gives 
the following equation for s in terms of t: 

s = 12t — 9.8 x 4t? +b, 


Solutions to activities 


that is, 
s = 12t — 4.9t? + b, 
where again b is a constant. 


At the start of the motion, the displacement of 
the ball is Om. That is, when t = 0, s = 0. 


Substituting these values into the equation for s 
above gives 
0=12x0-49x07+b, that is, 
So the equation for s in terms of t is 
s = 12t — 4.91”. 


b= 0. 


(c) When t = 1, 


v = 12 — 9.8 x 1 = 12 — 9.8 = 2.2, 
and 
s= 12 x 1 — 4.9 x 1? = 12 — 4.9 = 7.1. 
So the velocity and the displacement of the ball 


1 second after it was thrown are 2.2ms7! and 
7.1m, respectively. 


(d) When the ball has fallen back to the point from 


which it was thrown, s = 0. Substituting this 
value of s into the equation found in part (b) 
gives 

0 = 12t — 4.91? 

12t — 4.9t? = 0 

t(12 — 4.9t) =0 

t = 0 or t = 12/4.9 = 2.4 (to 1 d.p.). 
So the ball takes about 2.4 seconds to fall back 
to the point from which it was thrown. 


(The displacement-time graph for the ball is 
shown below. You can see that the ball first 
rises and then falls, as expected. You can also 
see that it reaches a height of about 7m after 

1 second, and that it returns to the point from 
which it was thrown after about 2.4 seconds, as 
you would expect from the answers found in 
parts (c) and (d).) 
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84 s = 12t — 4.91? 


displacement, s (m) 


Solution to Activity 34 


(a) The given equation for m in terms of q is 
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m = 250 — 0.59. 
Since marginal cost is the derivative of total 
cost, the equation for t in terms of q is 

t = 250g — 0.5 x 4? + c, 
that is, 

t = 250q — 0.250? + c, 
where c is a constant. 
According to the information given in the 
question, when q = 80, t = 30 400. 
Substituting these values into the equation 
found for t above gives 

30 400 = 250 x 80 — 0.25 x 80? + c 

30 400 = 18 400 + c 

c = 12000. 
Hence the equation for t in terms of q is 

t = 250g — 0.25q7 + 12000. 
The total weekly cost of producing 160 tonnes 
of fertiliser each week is 

£(250 x 160 — 0.25 x 160? + 12000) 

= £45 600. 
The question states that the total weekly cost of 
producing 80 tonnes of fertiliser each week is 


£30400. Hence the cost per tonne of producing 


80 tonnes of fertiliser each week is 
4 
£30400 — £380. 


80 


It was calculated in part (b) that the total 
weekly cost of producing 160 tonnes of fertiliser 
each week is £45600. Hence the cost per tonne 
of producing 160 tonnes of fertiliser each week is 
£45 600 
160 
(The graph of t against q is shown below. You can 
see that the total weekly cost of producing 
160 tonnes of fertiliser each week is about £45 000, 
as expected.) 


= £285. 


t = 250g — 0.25q? + 12.000 


total cost, t (£) 


25 50 75 100 125 150 175 200 
quantity, q (tonnes) 


Solution to Activity 35 
An antiderivative of f is 
F(a) = 8e7/®. 
So the change in the quantity from « = 2 to x = 5 is 
F(5) — F(2) = 8e°/® — 8e?/8 
=A67 (to 3 sf) 


Solution to Activity 36 
An antiderivative of f is 
_ Bg E 342 
F@)=3t=2 «if =3t- 37. 
(a) The change in the displacement (in metres) of 
the object from t = 0 to t = 1 is 
F(1) — F(0) = (3x 1-2 1’) 
— (3x 0-2 x 0°) 
= 2.25. 


That is, the object travels 2.25m in that time 
(in the positive direction). 


(b) The change in the displacement (in metres) of 


the object from t = 4 to t= 5 is 
F(5) — F(4) = (3x 5-3 x 5?) 
—(3x 4-3 x 4?) 
= (15 — Ẹ) — (12 — 12) 
= —3.75. 


That is, the object travels 3.75m in that time 
(in the negative direction). 


Solution to Activity 37 


4 
(a) The function f(x) = ; has indefinite integral 
F(x) =4In|z| + c. 


(b) The function 


x+i1 1 1 1 z 
f(a) = = 


has indefinite integral 


1 
F(x) =In|2| —27'+e¢=In|a2| -=+ c. 
z£ 
(c) The function 
1 
f(x) = = 
has indefinite integral 


F(x) =$lnz+e. 


(x > 0) 


(The answer F(x) = 4In|z| +c is also correct, 


= 2 
but can be simplified, as x takes only positive 
values. ) 


Solution to Activity 38 


(a) The indefinite integral of h(@) = 3 cos 0 + 4sin 0 


is 
H(0) = 3sin 0 + 4(— cos 0) + c 
= 3sin 0 — 4 cos + c. 


(b) The indefinite integral of g(¢) = 6 — 5 cosec? ¢ is 


G(¢) = 66 — 5(— cot $) + c 
= 66+ 5cot o +c. 
(c) The indefinite integral of f(x) = 4 is 
T 
F(x) = Tln|e| +c. 
(d) The function is 


1 
j=, 
g(t) F 
which can be written as 
H= 1 7 1 
a nT 


Solutions to activities 


Hence its indefinite integral is 
1 
The function is 
1 
= — (x > 0), 
fe) == (e>0) 
which can be written as 
1 1 
f(a) =— x — (x > 0). 
wT. È 
Hence its indefinite integral is 


1 
F(z)=-Inz+e. 
T 


3 
The indefinite integral of f(x) = is 
1+ 2? 
F(x) = 3tan™t z +c. 
1 
The indefinite integral of h(t) = ———= is 
g U= ae 


H(t) = sint t+ c. 
(Or, alternatively, H(t) = —4 cos™! t + c.) 


The function is 


(x) : 

r) = —— 

P 5 + 5a’ 
which can be written as 


ia! 
BUIT ae Oe 


Hence its indefinite integral is 
P(x) =% tan tate. 


The function is 


1 
r) = ——, 
g(x) ae 
which can be written as 
1 
r) = >> 
TO fare 
that is, 
(2) = —= 
t =a: 
# 2y 1 — r? 


Hence its indefinite integral is 
G(x) = 4 sint g +c. 


(Or, alternatively, G(x) = -4 cos-tx + c.) 
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(j) The function is 
g(x) = 5(2 — 3)(2z — 1), 
which can be written as 
q(x) = 5(2x” — 7x + 3). 
Hence its indefinite integral is 
Q(x) = 5(2 x $x? — 7 x $47 + 3r) +c 
= £r? — Ba? + 15r +c. 
(It could also be written as 
Q(x) = ğz(4r° — 21x + 18) + c, 
for example.) 


(k) The function is 


xr—2 

f(x) — £ 3 
which can be written as 

2 

fl) =1-=. 

T 


Hence its indefinite integral is 
F(x) = x — 2ln |z| + c. 
(1) The function is 
g(x) = “= 
which can be written as 
g(x) = 17? — 23, 


Hence its indefinite integral is 


7 


1 
G(x) = =z! — 2 x z +é 


(m) The indefinite integral of f(x) = 8e” is 
F(x) = 8e” +c. 
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(n) The function is 


f(x) = ee, 
which can be written as 
f(e) Sexe. 
Hence its indefinite integral is 
F(x)=exe”+c 
= gE g. 
(You can integrate the expression e x e” by 


using the constant multiple rule for 
antiderivatives, since e is a constant.) 


The indefinite integral of r(ġ) = — cosec ¢ cot d 


R(¢) = —(— cosec ġ) + c 
= cCosec ġ + c. 

The function is 

r(8) = sec 0(sec 8 + tan 0) 
which can be written as 

r(0) = sec? 0 + sec 0 tan 8. 
Hence its indefinite integral is 

R(6) = tan 0 + sec 0 + c. 
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Introduction 


In this unit you’ll continue your study of integration. In Unit 7 you saw 
that integration is the reverse of differentiation. You saw that you can use 
it to solve problems in which you know the values taken by the rate of 
change of a continuously changing quantity, and you want to work out the 
values taken by the quantity, or how much the quantity changes over some 
period. For example, you saw that if you have a formula for the velocity of 
an object in terms of time, then you can use integration to find a formula 
for the displacement of the object in terms of time, or work out the change 
in the displacement of the object over some period of time. 


In Sections 1 and 2 of this unit, you’ll meet a different way to think about 
integration. This way of thinking about it involves areas of regions of the 
plane, rather than rates of change. You'll see that the link between the two 
ways of thinking about integration helps you to solve problems both about 
areas and about rates of change. 


In Sections 3 to 5 of the unit, you’ll learn some more methods for 
integrating functions, which you can use in addition to the methods that 
you learned in Unit 7. These further methods will allow you to integrate a 
much wider variety of functions than you’ve learned to integrate so far. 


Many students find some of the material in this unit quite challenging 
when they first meet it, so don’t worry if that’s the case for you too. As 
with most mathematics, it will seem more straightforward once you’re 
more familiar with it. If you find a particular technique difficult, then 
make sure that you watch the associated tutorial clips, and use the 
practice quiz and exercise booklet for this unit for more practice. 


This unit is likely to take you more time to study than most of the other 
units, so the study planner allows extra time for it. 


Introduction 
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1 Areas, signed areas and definite 
integrals 


In this section you'll look at a type of mathematical problem that you 
might think has little connection with the mathematics of rates of change 
that you studied in Units 6 and 7. In Section 2 you'll see that in fact the 
two areas of mathematics are closely related. 


Throughout Sections 1 and 2, we'll work with continuous functions. 
Remember that a continuous function is a function whose graph has no 
breaks in it, over its whole domain. Informally, it’s a function whose graph 
you can draw without taking your pen off the paper (though you might 
need an infinitely large piece of paper, as the graph might be infinitely 
long!). 


1.1 Areas and signed areas 


Sometimes it’s useful to calculate the area of a flat shape with a curved 
boundary. For example, suppose that an architect is designing a large 
building shaped as shown in Figure 1(a). The curve of the roof is given by 
the graph of the function 


f(v) =9- $? (-18 < x < 18), 


where x and f(x) are measured in metres. This graph is shown in 
Figure 1(b). 


(a) 


Figure 1 (a) The shape of a building (b) the graph that gives the curve 
of the roof 


1 Areas, signed areas and definite integrals 


Suppose that the architect wants to calculate the area shaded in 
Figure 1(b), so that she can determine the amounts of materials needed to 
build the end wall of the building. 


Here’s a way of calculating an approximate value for this area, which you 
can make as accurate as you wish. You start by dividing the interval 
[—18, 18], which is the interval of x-values that corresponds to the curve, 
into a number of subintervals of equal width. 


For example, in Figure 2 the interval has been divided into twelve 
subintervals. The right endpoint of the first subinterval is equal to the left 
endpoint of the next subinterval, and so on. 


For each subinterval, you approximate the shape of the curve on that 
subinterval by a horizontal line segment whose height is the value of the 
function at the left endpoint of the subinterval, as shown in Figure 2. You 
calculate the areas of the rectangles between these line segments and the 
x-axis (simply by multiplying their heights by their widths in the usual 
way), and add up all these areas to give you an approximate value for the 
required area. The more subintervals you use, the better will be the 
approximation. 


= 
CO 
RY 


—18 


Figure 2 A collection of rectangles whose total area is approximately the 
area in Figure 1(b) 


In the next example this method is used, with four subintervals, to find an 
approximate value for the area discussed above. Notice that, in both the 
example and in Figure 2 above, the rectangle corresponding to the first 
subinterval has zero height and therefore zero area — this happens because 
the value of the function at the left endpoint of this subinterval is zero. 
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Example 1 Calculating an approximate value for an area 


Use the method described above, with four subintervals as shown 
below, to find an approximate value for the area between the graph of 
the function f(x) =9—- Wx" and the z-axis. 


= 
lo) 
Ry 


Solution 


We divide the interval [—18, 18] into four subintervals of equal width. 
The whole interval has width 36, so each subinterval has width 
36/4 = 9. 


The left endpoints of the four subintervals are 

—18, -18+9, -18+2x9, -18+3x9, 
that is 

—18, -9, 0, 9. 
So the heights of the rectangles are 

tS), a f (0), Go); 
that is, 

9— $ x (-18)?, 9-4 x (-9)*, 9-4x0, 9-9”, 
which evaluate to 

De 27 

O F a a 

So we obtain the following approximate value for the area: 


405 
(0 x 9) + (% x 9) + (9 x 9) + (Z x 9) = —— = 202.5. 


The calculation in Example 1 shows that a (rather crude) approximate 
value for the area of the cross-section of the roof discussed at the 
beginning of this subsection is 202.5 m?. 
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Note that the units for the area of a region on a graph are, as you’d 
expect, the units on the vertical axis times the units on the horizontal axis. 
For example, if the units on both axes are metres, then the units for area 
are square metres (m?). If a graph has no specific units on the axes, then 
the units for area are simply ‘square units’, and we usually omit them 
when we state an area. 


In the next activity you’re asked to find another approximation for the 
area of the cross-section of the roof discussed earlier, by using the 
subinterval method with six subintervals instead of four. 


Activity 1 Calculating an approximate value for an area 
Use the method described above, with six subintervals as shown below, to 


find an approximate value for the area between the graph of the function 
f(x) =9- wx" and the z-axis. 


y 
al 
_18 | ig = 


Table 1 shows the approximate values for the area between the graph of 
the function f(x) = 9 — 32” and the x-axis that were found in Example 1 
and in the solution to Activity 1. It also shows, to three decimal places, 
some further approximate values that were found in the same way, but 
using larger numbers of subintervals. The calculations were carried out 
using a computer. 


Table 1 Approximate values for the area in Figure 1(b) 


Number of subintervals 4 6 12 50 100 500 
Approximation obtained 202.5 210 214.5 215.914 215.978 215.999 


You can see that as the number of subintervals gets larger and larger, the 
approximation obtained seems to be getting closer and closer to a 
particular number. We say that this number is the limit of the 
approximations as the number of subintervals tends to infinity. It’s the 
exact value of the area between the graph of the function f(x) = 9 — qu" 
and the x-axis. From Table 1, it looks as if the exact value of this area is 
216, or perhaps a number very close to 216. 
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In general, if you have a continuous function f whose graph lies on or 
above the z-axis throughout an interval [a,b], then you can use the method 
demonstrated above to find, as accurately as you want, the area between 
the graph of f and the z-axis, from x = a to x = b. Figure 3(a) illustrates 
an area of this type, and Figure 3(b) illustrates the approximation to the 
area that’s obtained by using 8 subintervals. 


RY 
Xy 


(b) 


Figure 3 (a) The area between a graph and the z-axis, from z = a to 
x = b (b) A collection of rectangles whose total area is approximately 
this area 


Here’s a summary of the method that you’ve met for finding an 
approximate value for an area of this type. You start by dividing the 
interval [a,b] into a number, say n, of subintervals of equal width. The 
width of each subinterval is then (b — a)/n. For each subinterval you 
calculate the product 


f ( left endpoint ) . b — A (1) 


of subinterval n 


and you add up all these products. In general, the larger the number n of 
subintervals, the closer your answer will be to the area between the graph 
of f and the z-axis, from « = a to x = b. 


Now suppose that you have a continuous function f whose graph lies on or 
below the x-axis throughout an interval [a,b], and you want to calculate 
the area between the graph of f and the z-axis, from z = a to x = b, as 
illustrated in Figure 4(a). 


You can use the method above, with a small adjustment at the end, to 
calculate an approximate value for an area like this. Consider what 
happens when you apply the method to a graph like the one in 

Figure 4(a). You start by dividing the interval [a,b] into n subintervals of 
equal width, then you calculate all the products of form (1) and add them 
all up. For a graph like this, because the value of f at the left endpoint of 
each subinterval is negative (or possibly zero), each product of form (1) 
will also be negative (or zero). In fact, each product of form (1) will be the 
negative of the area between the line segment that approximates the curve 
and the z-axis, as illustrated in Figure 4(b). So when you add up all the 
products of form (1), you’ll obtain an approximate value for the negative of 
the area between the curve and the z-axis, from z = a to x = b. 


1 Areas, signed areas and definite integrals 


Sv 
xy 


(a) (b) 


Figure 4 (a) The area between another graph and the z-axis, from x = a 
to x = b (b) A collection of rectangles whose total area is approximately 
this area 


That’s not a problem, because you can simply remove the minus sign to 
obtain the approximate value for the area that you want. However, to help 
us deal with situations like this, it’s useful to make the following 
definitions. These definitions will be important throughout the unit. 


Consider any region on a graph that lies either entirely above or entirely 
below the x-axis. The signed area of the region is its area with a plus or 
minus sign according to whether it lies above or below the x-axis, 
respectively. For example, in Figure 5 the two shaded regions above the 
z-axis have signed areas +4 and +6, respectively, which you can write 
simply as 4 and 6, and the shaded region below the x-axis has signed 
area —3. 


| 
bo - 
| 
Ne} 
i a 
iw 
Oa 
Ry 


Figure 5 Regions on a graph 


If you have a collection of regions on a graph, where each region in the 
collection lies either entirely above or entirely below the x-axis, then the 
total signed area of the collection is the sum of the signed areas of the 
individual regions. For example, the total signed area of the collection of 
three regions in Figure 5 is 4+ (—3) +6 = 7. 


The units for signed area on a graph are, as you’d expect, the same as the 
units for area on the graph. That is, they’re the units on the vertical axis 
times the units on the horizontal axis. If there are no specific units on the 
axes, then we don’t use any specific units for signed area. 
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Example 2 Understanding signed areas 
The areas of some regions on a graph are marked below. 


ta {| WELCOME | In each of parts (a)—(c), use these areas to find the signed area 
| Leafy Hil ¢ | between the graph and the z-axis, from the first value of x to the 
second value of zx. 


(a) From z = —3 to x = -2. (D) From a= =2 toa = I 


(ec) From a= "—=3) to ¢ = 1. 


Negative signed area Solution 
(a) The signed area from x = —3 to x = —2 is —2.2. 
(b) The signed area from x = —2 tox = 1 is +4.8 = 4.8. 


(c) The signed area from x = —3 to x = 1 is —2.2 + 4.8 = 2.6. 


Of course, the signed area value found in Example 2(c) doesn’t correspond 
to any actual area on the graph, as it’s the sum of a positive signed area 
and a negative signed area. 
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Activity 2 Understanding signed areas 


Consider again the graph in Example 2. In each of parts (a)—(d) below, use 
the given areas to find the signed area between the graph and the x-axis, 
from the first value of x to the second value of x. 


(a) From z = 1 to r = 4. (b) From z = 5 to x =7. 
(c) From z = 1 tox =7. (d) From z = 1 to z =9. 


With the definition of signed area that you’ve now seen, the method that 
you’ve met in this subsection can be described concisely as follows. 


Strategy: 

To find an approximate value for the signed area between 
the graph of a continuous function f and the x-axis, from 
to toi — 0 


Divide the interval between a and b into n subintervals, each of 
width (b — a)/n. For each subinterval, calculate the product 


d 


; endpoint of subinterval : b—a 
nearest a n 


and add up all these products. 


In general, the larger the number n of subintervals, the closer your 
answer will be to the required signed area. 


Notice that the box above uses the phrase ‘endpoint of subinterval 

nearest a’, rather than ‘left endpoint of subinterval’, which means the same 
thing. This will be convenient later in this subsection. (Note that, in this 
phrase, it’s the endpoint of the subinterval, not the subinterval itself, 
that’s nearest a!) 


Figures 6 and 7 illustrate the strategy above. If you add up the signed 
areas of all the rectangles in Figure 6(a), then you'll obtain an 
approximate value for the signed area in Figure 6(b). 


| ae teal 


(a) (b) 


Figure 6 (a) A collection of rectangles whose total signed area is 
approximately the signed area shown in (b) 
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Similarly, if you add up the signed areas of all the rectangles in Figure 7(a), 
then you'll obtain an approximate value for the signed area in Figure 7(b). 


YA YA 


(a) (b) 


Figure 7 (a) Another instance of a collection of rectangles whose total 
signed area is approximately the signed area shown in (b) 


Activity 3 Calculating an approximate value for a signed area 


Use the method described in the box above, with six subintervals as shown 
on the left below, to find an approximate value for the signed area between 
the graph of the function f(x) = 3 — x? and the x-axis from x = —3 to 

x = 3, as shown on the right below. 


In the next activity you can use a computer to explore approximations to 
signed areas found by using subintervals in the way that you’ve seen. 
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(mere) 
Activity 4 Calculating more approximate values for signed areas E 


Open the applet Approximations for signed areas. Check that the function 
is set to f(x) = 3 — a”, the values of a and b are set to —3 and 3 
respectively, and the number of subintervals is set to 6. The resulting 
approximation for the signed area between the graph of f and the z-axis, 
from x = a to x = b, should then be the value found in the solution to 
Activity 3. 


Now increase the number of subintervals, and observe the effect on the 
approximation. What do you think is the exact value of the signed area? 


Experiment by changing the function, the values of a and b, and the 
number of subintervals, to find approximate values for some other signed 
areas. 


So far in this subsection, whenever a signed area from x = a to x = b has 
been discussed, the value of b has been greater than the value of a. 


The value of b can also be equal to the value of a, and this gives a signed 
area of zero, as illustrated in Figure 8. In cases like this, the method that 
you’ve seen for using subintervals to find approximate values for signed 
areas gives the answer zero, since the width (b — a)/n of each subinterval is 
zero. 


Ry 


Figure 8 A signed area equal to zero 


In fact, for reasons that you’ll see later in the unit, it’s useful to extend the 
definition of signed area a little, to give a meaning to the phrase ‘the 
signed area between the graph of f and the z-axis from «=a to x = b’ 
when b is less than a. It’s not immediately obvious what the phrase means 
in this case, but it turns out that the natural meaning is as follows. This 
definition will be important throughout the unit. 
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Suppose that f is a continuous function whose domain includes the 
interval [b, a], as illustrated in Figure 9. Then the signed area between 
the graph of f and the z-axis from x = a to x = b is defined to be the 
negative of the signed area between the graph of f and the x-axis from 
x=btoxr=a. 


Sv 


Figure 9 The graph of a continuous function f whose domain includes 
the interval |b, a] 


For example, consider the graph in Figure 10. The signed area from x = 1 
to x = 3 is 4, so the signed area from g = 3 to x = 1 is —4. Similarly, the 
signed area from x = 6 to x = 8 is —5, so the signed area from x = 8 

to x = 6 is —(—5) = 5. 


Figure 10 Areas on a graph 


Because of this extended definition of signed area, you now have to think a 
little more carefully when you read or use the phrase ‘signed area’ (and the 
signed area isn’t zero). If there’s no mention of ‘from x = a to x = b’, then 
to decide whether the signed area is positive or negative you just need to 
consider whether it lies above or below the z-axis. However, if the text is 
of the form ‘the signed area from x = a to x = b’, then to decide whether 
the signed area is positive or negative you also have to consider whether b 
is greater than or less than a. 


This is illustrated in the next example. A helpful way to think about 
whether b is greater than or less than a is to think about whether x moves 
forward or backward as it changes from x = a to x = b. 


1 Areas, signed areas and definite integrals 


Example 3 Understanding the extended definition of signed area 


The areas of some regions on a graph are marked below. In each of 
parts (a)—(d), use the given areas to find the signed area between the 
graph and the x-axis, from the first value of x to the second value 

OM gi- 


(a) From a = 2 to as— 3, (b) From o = 6 toa = 10: 
(e) Brom v= 10 w0 m= 6: (a0) Prom 7 = 0 to 2 = =i 


Solution 

(a) ®. From 3 to 3 is from a number to the same number. ® 
The signed area from gx = 3 to x = 3 is 0. 

(b) & From 6 to 10 is forward. @ 
The signed area from x = 6 to x = 10 is —5.4. 

(c) @ From 10 to 6 is backward. & 


The signed area from x = 6 to x = 10 is —5.4, 
so the signed area from x = 10 to x = 6 is 5.4. 


(d) & From 6 to —1 is backward. & 


The signed area from x = —1 to x = 6 is —2.4 + 4.0 = 1.6, 
so the signed area from x = 6 to x = —1 is —1.6. 


In each part of the next activity, start by thinking about whether x moves 
forward or backward (or stays fixed) as it moves from the first value of x 
to the second value of x. 
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Activity 5 Understanding the extended definition of signed area 


The graph in Example 3 is repeated below. In each of parts (a)—(g), use 
the given areas to find the signed area between the graph and the x-axis, 
from the first value of x to the second value of x. 


(a) From z = 2 to x = 6, (b) From z = 6 to ¢ =2. 

(c) From z = 2 to z = 10. (d) From z = 10 to x = 2. 
(e) From z = 8 to z = 8. (f) From z = 2 to x = —3. 
(8) 


Cc 


From z= 10 tox = —1. 


The subinterval method for finding approximate values for signed areas, 
which is summarised in the box on page 113, applies no matter whether 
the value of b is greater than, less than, or equal to the value of a. The 
reason why it works in cases where b is less than a is that in these cases 
the ‘width’ (b — a)/n of each subinterval is negative. 


You might like to use the applet Approximations for signed areas to see 
examples of the method giving approximate values for signed areas in cases 
where b is less than a. 


Of course, if you want to work out an area between a curve and the z-axis 
over an interval [a,b] by using the method that you’ve seen in this 
subsection, then normally you would work out the signed area from a to b, 
rather than the signed area from b to a. However, it’s important to 
understand what’s meant by the signed area from b to a, for reasons that 
you'll see later in the unit. 
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1.2 Definite integrals 


Before you learn more about why signed areas are important, it’s useful for 
you to learn some terminology and notation that are used when working 
with them. 


If f is a continuous function and a and b are numbers in its domain, then 
the signed area between the graph of f and the z-axis from x = a to x = b 
is called the definite integral of f from a to b, and is denoted by 


f to m 


This notation is read as ‘the integral from a to b of f of x, d x’. The 
numbers a and b are called the lower and upper limits of integration, 
respectively. 


For example, for the function f in Figure 11, 
7 9 
| f@ar=-9, f fad, 
3 7 


9 
and f f(z) dz = —9 + 2 = -7. 
3 


Similarly, for the same function, 


4 3 
[ ta@ar=o. and [ t@ar=-(-9) =8. 


Figure 11 The graph of a function, and some areas 


Youll see a little later in this subsection where this notation for a definite 
integral comes from. 


The symbol f is called the integral sign. It sometimes appears in a 


smaller form, f, when it’s in a line of typed text. 
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As with Leibniz notation for derivatives, the ‘d’ in the notation for definite 
integrals has no independent meaning. In many texts, including this one, 
it’s printed in upright type, rather than italic type, to emphasise that it’s 
not a variable. You don’t need to do anything special when you 

handwrite it. 


When you use the notation f? f(x) dx, remember that you must include 


not only the f? at the beginning, but also the dx at the end. Try not to 
forget the dz! 


You’ve now seen that a definite integral is a type of signed area, and you 
saw in Unit 7 that an indefinite integral is a type of general antiderivative. 
These two seemingly unrelated concepts are, as you probably suspect, 
closely related. Youv’ll see how in Section 2. 


The box below lists some standard properties of definite integrals, which 
come from their definition as signed areas. These properties hold for all 
values of a, b and c in the domain of the continuous function f. The 
second property comes from the extended definition of signed area, to 
cases where b is less than a. The third property expresses the fact that the 
signed area from a to c is equal to the signed area from a to b plus the 
signed area from b to c. Figure 12 illustrates this property in a case where 
a <b < c, but the extended definition of signed area ensures that the 
property holds even when a, b and c aren’t in this order. This is one reason 
why the definition of signed area is extended in the way that you’ve seen. 


Standard properties of definite integrals 


oa 


[ to- f sear 
[roa f toa f toa 


RY 


Figure 12 Two adjacent signed areas 


1 Areas, signed areas and definite integrals 


Activity 6 Understanding definite integrals 


Consider the function f whose graph is shown below. The areas of some 
regions are marked. By using these areas, write down the values of the 
following definite integrals. 


Hint: notice that in some of these definite integrals the upper limit of 
integration is less than the lower limit of integration. 


@ [sear w fioa [tear 
o f sear © f naar O f roa 


5 —3 2 
(e) / f(a)dx (h) f fajde G) f f(x) dx 
-3 


—5 —5 
0) / Hed Wf sew 0) f f(a) dr 


7 


Activity 7 Using a standard property of definite integrals 


Consider again the graph in Activity 6. 


9 9 
Given that f f(z) dz = —15, find | F(a) da. 
2 7 
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As you’d expect, you can replace the expression f(x) in the notation for a 
definite integral by the formula for a particular function. For example, the 
signed areas in Figure 13 are denoted by 


1 1 
f r?’dr and f (z? — 1) dz, 
-1 0 


respectively. 


(b) 


Figure 13 Signed areas between the graphs of particular functions and 
the x-axis 


As always with algebraic notation, the notation for a definite integral can 
be used with letters other than the standard ones. For example, if g is a 
continuous function whose domain contains the numbers p and q, and you 
use t to denote the input variable of g, then the definite integral of g 
from t = p to t = q is denoted by 


f e 


In fact, the input variable of the function in a definite integral is what’s 
known as a dummy variable — you can change its name to any other 
variable name that you like, without affecting the value of the definite 
integral. For example, if f is a continuous function whose domain contains 
the numbers a and b, then 


f tow f roas f toau 


and so on. As a particular example, 


1 ï 1 
/ x’ dx =] Par= | u? du, 
-1 =i -1 


since all these definite integrals denote the signed area shown in 

Figure 13(a). 

If f is any continuous function, and a and b are numbers in its domain, 
then you can find an approximate value for the definite integral p f(x) dz, 


as accurately as you like, by using the method that you met in the last 
subsection. Here’s the method expressed algebraically. 


1 Areas, signed areas and definite integrals 


Suppose that you want to find a f(x) da, as illustrated in Figure 14(a). 
You divide the interval between a and b into n subintervals, each of 
width (b — a)/n, as illustrated in Figure 14(b). We’ll denote (b — a)/n 
by w here, for conciseness. 


RY 


Xy 


a b 


a 
n subintervals 


(a) (b) 


Figure 14 (a) A definite integral f? f(x) dx (b) A collection of n 
rectangles whose total signed area is approximately this definite integral 


The endpoints nearest a of the subintervals are 


a+0w, atlw, a+2w, ..., at(n—1)w. 


(Remember that the strategy on page 113 specifies ‘endpoint of subinterval 
nearest a’, rather than ‘left endpoint of subinterval’. The reason for this is 
that it ensures that the algebraic expressions for the endpoints stated 
above are correct when b is less than a, as well as when b is greater than a 
or equal to a.) 


So an approximate value for the definite integral of f from x =a to x = b 
is given by 


f(a+0w) x w+ f(a+lw) x w+ f(a+2w) x w+- 
--+ fla + (n -— 1w) x w. 


This expression can be simplified slightly by taking out the common 
factor w. Doing this (and writing the common factor at the end) gives 


(f(a + Ow) + f(at+1lw)+ f(a+2w)+---+flat+(n- Lw)) w, 


As the value of n gets larger and larger, the value of this expression gets 
closer and closer to the value of fe Je) da. 


The limit of the value of the expression, as the value of n tends to infinity, 
is the exact value of ri f(x) dx. This idea is summarised below. The 
notation ‘ lim ’ means ‘the limit, as n tends to infinity, of’. 

Noo 
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Algebraic definition of a definite integral 


Suppose that f is a continuous function and a and 6 are numbers in 
its domain. Then the definite integral of f from z =a to x = b is 
given by the equation 


b 
l f(a) da = lim (f(a +0w) + f(a + 1w) + f(a + 2w) ++ 


+ + f(a + (n= 1)w))w 


where w = (b — a)/n. 


If f is a particular continuous function, and a and b are numbers in its 
domain, then you can sometimes calculate an exact value for T f(x) dx by 
using an algebraic method to find the limit in the box above. In some 
cases you might even be able to obtain a general formula for H f(z) de; in 
terms of a and b. However, this is usually a complicated and difficult 
process, so normally we don’t do it. Instead, if you want to evaluate a 


definite integral f? f(x) dx, then you have two main options. 


The first option is to obtain an approximate value by using subintervals in 
the way that you’ve seen. We usually use a computer to carry out the 
calculations. The computer algebra system that you’re using in this 
module includes a command that instructs the computer to find an 
approximate value for a definite integral, to a particular level of precision, 
by using a version of the subinterval method that you’ve seen. You’ll learn 
about this command in the final subsection of this unit. 


The other option for evaluating a definite integral I f(x) dx is to use an 
algebraic method that arises from the fundamental theorem of calculus, 
which you'll meet in Section 2. This algebraic method involves much less 
calculation than using subintervals, and it has the advantage of giving an 
exact answer. However, sometimes it’s not possible or not practicable to 
use this method, as you'll see. 


Now here’s an explanation of where the notation and terminology for 
definite integrals come from. They arise directly from the method that 
you've seen for calculating approximate values for definite integrals. To see 
how, consider applying the method to a function f, as illustrated in 
Figure 15(a). 


Consider any one of the subintervals. You could denote its endpoint 
nearest a by x, and its width by dz, as shown in Figure 15(b). As you saw 
in Unit 6, the notation ôx represents a small change in zx. 


1 Areas, signed areas and definite integrals 


Ry 


Figure 15 (a) The subinterval method applied to a function f from 
x =a to x = b (b) One of the subintervals and its corresponding rectangle 


With this notation, the signed area of the rectangle corresponding to the 
subinterval, from x to x + 62, is 


f(a) dx. (2) 


So the total signed area of all the rectangles is the sum of a number of 
terms, each of form (2). As the number of subintervals gets larger and 
larger, the size of dx gets smaller and smaller, and the total signed area of 
the rectangles gets closer and closer to the definite integral of f from a 

to b. So, loosely, you can think of this definite integral as the sum of 
infinitely many terms of form (2), where the quantity ôx is infinitely small. 
Historically, as you saw in Unit 6, the quantity dx was denoted by dx when 
it becomes infinitely small, so the sum described above was denoted by 


[see 


where the symbol f is an elongated ‘S’, which stands for ‘sum’. 


Johann Bernoulli (1667—1748) 
To see where the term ‘integral’ comes from, remember that the verb ‘to 
integrate’ means ‘to join together’. Loosely, you can think of a definite 
integral as being obtained by joining together infinitely many signed areas, 
each infinitely narrow, into a single signed area. 


The name and principal symbol for integral calculus were discussed 
by Gottfried Wilhelm Leibniz and the Swiss mathematician 

Johann Bernoulli. Leibniz preferred ‘calculus summatorius’ as the 
name, and f, the elongated S, as the symbol. Bernoulli preferred 
‘calculus integralis’ as the name, and the capital letter I as the 
symbol. Eventually they agreed to compromise, adopting Bernoulli’s 
name ‘integral calculus’ and Leibniz’s symbol f. The first appearance 
in print of the word ‘integral’ was in a work by Johann’s brother, 
Jacob Bernoulli, in 1690, though Johann insisted that he had been 
the one to introduce the term. 


Jacob Bernoulli (1654-1705) 
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Joseph Fourier (1768-1830) 
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The modern notation for a definite integral, with limits at the bottom 
and top of the integral sign, is due to the French mathematician 
Joseph Fourier, who introduced it in his pioneering book on heat 
diffusion Théorie analytique de la chaleur (The analytic theory of 
heat) of 1822. He first published the notation four years previously in 
an announcement for the book. Fourier accompanied 

Napoleon Bonaparte on his Egyptian expedition of 1798, and later 
supported Jean-Francois Champollion, the decipherer of the Rosetta 
Stone. He is remembered today for initiating the investigation of 
Fourier series. These are infinite trigonometric series, which are now 
named after him, and which arose in the context of his work on heat 
diffusion. You’ll meet the idea of a series in Unit 10. 


To finish this section, here’s a summary of the main idea that you’ve met. 


Definite integrals 


Suppose that f is a continuous function, and a and b are numbers in 
its domain. The signed area between the graph of f and the z-axis 
from x = a to x = b is called the definite integral of f from a to b, 
and is denoted by 


| ae 


2 The fundamental theorem of 
calculus 


In this section you'll discover how the ideas about signed areas that you 
met in Section 1 are connected with the ideas about rates of change that 
you met in Units 6 and 7. The two areas of mathematics are linked by an 
important fact known as the fundamental theorem of calculus. 


This theorem is stated in the first subsection of this section, preceded by 
an explanation of why it’s true. It’s a good idea to read the explanation, 
to increase your understanding, particularly if you intend to study more 
mathematics after this module. However, if you find the explanation hard 
to understand, then you can skip it, and go straight to the box headed 
‘Fundamental theorem of calculus’, near the end of Subsection 2.1. Read 
the contents of this box, and continue reading to the end of the subsection. 
Make sure that you understand the fundamental theorem of calculus, and 
the other important facts stated there, even if you don’t know why they’re 


2 The fundamental theorem of calculus 


true. The two subsections that follow, Subsections 2.2 and 2.3, show you 
how to use the fundamental theorem of calculus. You might like to try 
reading Subsection 2.1 again later, as some of the ideas that you found 
difficult the first time might seem easier once you’re more familiar with the 
ways in which you can use the fundamental theorem of calculus. 


2.1 What is the fundamental theorem of calculus? 


The first step in obtaining a link between signed areas and rates of change 
is to define, for any continuous function f, a new function whose values are 
signed areas between the graph of f and the z-axis. Here’s how we do that. 


Consider any continuous function f, as illustrated in Figure 16. 


y = f(z) 


Sv 


Figure 16 The graph of a continuous function f 


Let’s choose any number, say s, in the domain of f, and define a new 
function, A, to have the same domain as f and the following rule: 


A(x) = signed area between the graph of f 
and the x-axis, from s to a. 


This definition is illustrated in Figure 17. 


A(x) = signed 
y area from s to x 


Figure 17 A continuous function f and a signed area A(x) 


We call the function A the signed-area-so-far function for the 
function f, with starting point s. Its rule can be expressed concisely as 
follows: 


A(x) = I HA) dt. 
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(Here the variable t has been used in the definite integral instead of the 
usual variable x, to avoid confusion with the input variable x of the 
function A.) 


Note that if a and b are any two numbers in the domain of f, then the 
signed area between the graph of f and the z-axis, from z = a to x = b, is 
given by 


A(b) — A(a). 


In other words, 


b 
il iois Ale. 


This fact is illustrated in Figure 18, in a case where s < a < b. The third 
property of definite integrals in the box on page 120 tells you that the fact 
holds even when s, a and b aren’t in this order. 


Figure 18 An illustration of the fact that i f(x) dx = A(b) — A(a) 


Now let’s consider how the value of a signed-area-so-far function changes 
as the value of the input variable x changes. Let’s start by looking at what 
happens when the function f is a constant function. 


2 The fundamental theorem of calculus 


For example, consider the constant function f(x) = 3, whose graph is 
shown in Figure 19. Let’s take the starting point s for the 
signed-area-so-far function A to be 1, as indicated in Figure 19. 


Sissi het 


Figure 19 The graph of the constant function f(x) = 3 


Figure 20 shows some values of A(x). 


ee 


= N we 
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Figure 20 Some values of A(x) when f is the constant function f(x) = 3 
and the starting point s is 1 


You can see that the signed area A(x) changes at the rate of 3 square units 
for every unit by which x changes. For example, 


if x increases by 1, then A(x) increases by 3 
if x increases by 0.5, then A(x) increases by 0.5 x 3 = 1.5 
if x increases by —1, then A(x) increases by —1 x 3 = —3. 


(Remember that a negative increase is a decrease.) 


In other words, the rate of change of A(x) with respect to z is 3. 
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More generally, you can see that for any constant function f(x) = k, and 
any signed-area-so-far function A for f, the value of A(x) changes at the 
rate of k square units for every unit by which x changes. In other words, 
the rate of change of A(z) is k. 


There are two more examples of this in Figure 21. In Figure 21(a), the 
function is f(x) = 1.5, so the signed area A(x) changes at the rate of 
1.5 square units for each unit by which x changes. In Figure 21(b), the 
function is f(x) = —2, so the signed area A(x) changes at the rate of 
—2 square units for each unit by which æ changes. 


rate of change of rate of change of 
A(z) is 1.5 A(x) is —2 
YA 
2- 
1A 
T T T T mal x 
=A= ed 
ENS Ey P 


Figure 21 Signed-area-so-far functions for constant functions, with 
starting points s = —1 in (a) and s = 3 in (b) 

Now let’s look at what happens for a function f that isn’t a constant 
function. Consider, for example, the function f whose graph is shown in 
Figure 22. The starting point s for the signed-area-so-far function A has 
been chosen to be 2. 


Figure 22 A signed-area-so-far function for a function f that isn’t a 
constant function 


For this function, as the value of x changes, the value of A(x) doesn’t 
change at a constant rate. For example, as illustrated in Figure 23(a), 
when z is close to 5 the value of A(x) changes at the rate of about 3 square 
units for every unit by which x increases. Similarly, as illustrated in 
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Figure 23(b), when z is close to 7 the value of A(x) changes at the rate of 
about 4 square units for every unit by which x increases. 


Figure 23 (a) When z is close to 5, the rate of change of A(x) is about 3 
(b) When z is close to 7, the rate of change of A(x) is about 4 


More precisely, when x = 5 the instantaneous rate of change of A(x) is 3, 
and when x = 7 the instantaneous rate of change of A(x) is 4. In fact — 
and this is the crucial thing to realise — at any value of x, the 
instantaneous rate of change of A(x) is f(x), as illustrated in Figure 24. 


Figure 24 The instantaneous rate of change of A(x) at x is f(x) 


In other words, the derivative of the function A is the function f, and so 
the function A is an antiderivative of the function f. 


You can see that this will be true for any continuous function f and any 
signed-area-so-far function A for f. That is, we have the important fact 
below. Because of its importance, it deserves to be called a theorem. 


Theorem 


Suppose that f is a continuous function, and s is any number in its 
domain. Let A be the signed-area-so-far function for f with starting 
point s. Then A is an antiderivative of f. 


It’s this fact that leads to the fundamental theorem of calculus. Consider 
any continuous function f. Let s be any number in its domain, and let A 
be the signed-area-so-far function for f with starting point s. 
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Suppose that a and b are any numbers in the domain of f. Then, as you 
saw earlier in this subsection, the value of f? f(x) da, the signed area 
between the graph of f and the z-axis from x = a to x = b, is given by 


b 
| Kæde = AQ) - Ala). (3) 


Now let F be any antiderivative of f. Then, since A and F are both 
antiderivatives of f, the value of F(b) — F(a) is the same as the value of 
A(b) — A(a). That’s because, as you saw in Unit 7, the value of 

F(b) — F(a) is the same no matter which antiderivative of f you take F 
to be. 


So you can replace A(b) — A(a) by F(b) — F(a) in equation (3) above. 
This gives the crucial fact that’s known as the fundamental theorem of 
calculus. It’s stated concisely below. 


Fundamental theorem of calculus 


Suppose that f is a continuous function whose domain contains the 
numbers a and b, and that F is an antiderivative of f. Then 


b 
i Far an enElay: 


This theorem can be stated in words as follows. 


Fundamental theorem of calculus (in words) 


Suppose that f is a continuous function whose domain contains the 
numbers a and b, and that F is an antiderivative of f. Then the 
signed area between the graph of f and the z-axis from x = a to 

x = b is equal to the change in the value of F as x changes from x = a 
iO a = D. 


The equation in the fundamental theorem of calculus is true no matter 
whether the value a is less than, equal to or greater than the value b. This 
is the main reason why the definition of signed area is extended in the way 
that you saw earlier in this unit. 


Here’s another important fact that’s come out of the discussion in this 
subsection. The theorem in the box on page 131 tells you immediately that 
the following fact is true. 


Every continuous function has an antiderivative. 


2 The fundamental theorem of calculus 


In fact there’s some disagreement among mathematicians about exactly 
which theorem should be called ‘the fundamental theorem of calculus’. 
Everyone agrees that this name should be given to a theorem that links 
signed areas with rates of change. However, while some mathematicians 
use the name for the theorem that’s given this name in this module, other 
mathematicians use it for the theorem in the box on page 131. Still other 
mathematicians take the view that these two theorems are two 
fundamental theorems of calculus, or two parts of a single fundamental 
theorem of calculus. So if you search for ‘fundamental theorem of calculus’ 
on the internet, or look it up in books, then you’re likely to find a variety 
of different results. 


2.2 Using the fundamental theorem to find signed 
areas 

The fundamental theorem of calculus gives you a quick way to find the 

signed area between the graph of a continuous function f and the x-axis, 

from « =a to x = b, in cases where you know a formula for an 


antiderivative F of f. The theorem tells you that this signed area, which is 
the definite integral 


[ soe. 


is given simply by F(b) — F(a). 


For example, suppose that you want to find the area between the graph of 


the function f(x) = cosx and the z-axis, from x = 0 to x = 77/2, as YA 
illustrated in Figure 25. This area is the definite integral il 
T/2 
i cos x da. B 
0 1 T £ 
Since an antiderivative of f(x) = cosx is F(x) = sin x (as you saw in 2 
Unit 7), the fundamental theorem of calculus tells you that =i 
T/2 7 
f cosr de = sin (Z) -sn0=1-0=1. 
j es . ; Figure 25 The area between 
So the area in Figure 25 is 1 square unit. the graph of f(x) = cosx and 
the x-axis, from x = 0 to 


You may find it magical that a complicated area like the one in Figure 25 
can be found from such a simple calculation. In particular, it may seem 
amazing that although the area in Figure 25 depends on infinitely many 
values of cos x (namely all the values of cos x for x = 0 to x = 7/2), it can 
be calculated from the values of an antiderivative of cosx at just two 
values of x, namely x = 0 and x = 7/2. 


e= a2 


Notice that in the calculation above it doesn’t matter which of the 
infinitely many antiderivatives of f(x) = cosx you use, because they all 
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differ by a constant and hence they all lead to the same final answer. For 
example, using the antiderivative F(x) = sing +7 gives 


T/2 
f cosnda = (sin (Z) +7) - (sin0+7)=1+7-0-7=1. 
0 
It’s usually best to use the simplest antiderivative in calculations of this 
kind. 


You'll see another example of the fundamental theorem of calculus used to 
find a signed area shortly, and you’ll then have the opportunity to find 
some signed areas in this way yourself. First, however, it’s helpful for you 
to learn a type of notation that’s convenient in calculations of this kind. 


For any function F’, the expression 
F(b) — F(a) 


can be denoted by 
b 


[Fo] 


For example, 


[sin z]7? = sin (5) —sin0=1—-0O=1. 


You can practise evaluating expressions of this type in the next activity. 


Activity 8 Evaluating expressions of the form [F(a)]|° 


Evaluate the following expressions. In part (c), give your answer to three 
significant figures. 


(a) [da]; (b) [cos æ] =" (c) a 


With the square bracket notation introduced above, the fundamental 
theorem of calculus can be restated as follows. 


Fundamental theorem of calculus (square bracket notation) 


Suppose that f is a continuous function whose domain contains the 
numbers a and b, and F is an antiderivative of f. Then 


3 b 
f Pede = HOM 


In the next example, the fundamental theorem of calculus is used, together 
with the square bracket notation, to find a signed area — that is, to 
evaluate a definite integral. 
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You’ve now seen that, when you want to find a signed area, it’s much 
quicker and simpler to use the fundamental theorem of calculus than to 
use the method involving subintervals that you saw in Subsection 1.1. 
Another advantage of using the fundamental theorem of calculus is that it 
gives you an exact answer, rather than an approximate one. However, 
remember that you can use the fundamental theorem of calculus only when 
you can find a formula for the antiderivative that you need. Sometimes it’s 
difficult or impossible to find such a formula. 


In the next activity, you can try evaluating some definite integrals by using 
the fundamental theorem of calculus. Use the square bracket notation, as 
in Example 4, since you need to get used to this notation. You can find the 
antiderivatives that you need by using the table of standard indefinite 
integrals given near the end of Unit 7. This table is also included in the 
Handbook. 


Figure 26 The definite 
integral in Example 4 
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Activity 9 Evaluating definite integrals 
Evaluate the following definite integrals (which are shown in Figure 27). 


Give exact answers, and (except in parts (a) and (d), where the answer is a 
simple fraction or an integer) also give answers to three significant figures. 


(a) [ea (b) [eu (c) [iw 


7/4 > 1 1 
(d) ta sec” 0 d0 (e) l: Taw 


Figure 27 The definite integrals in Activity 9 


The next example demonstrates how to evaluate another definite integral 
by using the fundamental theorem of calculus. The only difference from 
the examples that you’ve seen so far is that the antiderivative that’s 
needed for the method can’t be found directly from the table of standard 
indefinite integrals. Instead, we have to start by manipulating the given 
function to express it as a sum of constant multiples of functions that we 
can integrate, then use the constant multiple rule and sum rule for 
antiderivatives, together with the table of standard indefinite integrals. 
You practised finding antiderivatives in this way in Unit 7. 


As shown in the example, a convenient way to carry out these sorts of 
manipulations is to manipulate the expression inside the notation 


[a ... dx. This expression is known as the integrand. For example, the 


definite integral i, x? dx has integrand x”. 


2 The fundamental theorem of calculus 


Example 5 Evaluating a more complicated definite integral CS) 
Evaluate the definite integral 
i (2x + 1)(a — 4) ae 
2 T 
(It is shown in Figure 28.) 
Solution 


@®. Manipulate the integrand to get it into a form that you can 
integrate. ® 


T (2x + 1) (x — 4) T 
2 


Il 

e 
A 

AN 

N 

8 

l 

as 8 

l 

| 
ae 

(en 

8 

| 
poe 
=a e 
wa 
ips 
ol 

RY 


x 
E 1 
e js. 
2 x A 


@. Apply the fundamental theorem of calculus, and evaluate the 


result. ® 
= ID Gao 4 Figure 28 The definite 
=p ig e ap integral in Example 5 
= pa = p= dina]; 


2 

=(P = eA in (02 AyD) 
= 16 — 98 4nd =A in 

= —2 — 4(In4 — In2) 

—~2— 41n(4/2) 

= —2 — 4ln 2 


@. There’s no need to find a decimal approximation, as the question 
didn’t ask for one. © 


There are other acceptable ways to leave the final answer in Example 5. 
For example, you could write it as —2(1 + 21n 2). 
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Activity 10 Evaluating more complicated definite integrals 


Evaluate the following definite integrals (which are shown in Figure 29). 
Give exact answers. 


(a) [veer (b) [ 


44d 
r 


dr 


dirais w f 


—1 


(a) 


Figure 29 The definite integrals in Activity 10 


When you use the square bracket notation, you can sometimes simplify 
your working by using the following rules. 


Constant multiple rule and sum rule for the square bracket 
notation 


[KF («)|? = k[F(2)]? , where k is a constant 
= Fe) +(e, 


For example, the constant multiple rule for the square bracket notation 
tells you that 


[5 sin z|? = 5[sin z]? 


and the sum rule for the square bracket notation tells you that 
[sin x + cos a = [sin z] + [cos x] E 
To see why these rules hold, you just need to use the definition of the 
square bracket notation. For example, 
[5sinz]? = 5sin 2 — 5sin 1 
= 5(sin 2 — sin 1) 


= 5[sin a]. 
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Similarly, 


[sin z + cos z|? = (sin 2 + cos 2) — (sin 1 + cos 1) 
= (sin 2 — sin 1) + (cos 2 — cos 1) 


= [sin z|? + [cos z|’. 


As always when a constant multiple rule and a sum rule hold, the sum rule 
for the square bracket notation also holds if you replace the plus sign on 
each side with a minus sign. To prove this, you combine the sum rule with 
the case k = —1 of the constant multiple rule. You saw this done for the 
constant multiple and sum rules for derivatives in Unit 6, Subsection 2.3. 


The next example illustrates how you can use the constant multiple rule 
and the sum rule for the square bracket notation when you’re evaluating a 
definite integral. 


= =} 
Example 6 Using the constant multiple rule and sum rule for the o) 
square bracket notation _ 


Evaluate the definite integral 


i + 2°) da. 


(It is shown in Figure 30.) 
Solution 


®. Use the fundamental theorem of calculus. ©& 
2 
2 
i (1+a°)dx = [e + 42°]; 
@. Use the sum rule. @ 


= [e]; + [zs] 


Figure 30 The definite 


integral in E le 6 
@. Use the constant multiple rule, and simplify the result. ® E R 


= (2—1) + }(24-1*) 
=14+4x15 


== 
= 7. 


You can practise using the constant multiple rule and sum rule for the 
square bracket notation in the next activity. 
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Activity 11 Using the constant multiple rule and sum rule for the 
square bracket notation 


Evaluate the following definite integrals. (They’re shown in Figure 31.) 
Give your answer to part (a) to three significant figures. 
1 


(a) [ora o f -adui 


1/2 


yY E2 YA 
20 ATs y = u — 2u? 
15 i > 
10 7. 


5 4 
2476 / 


Figure 31 The definite integrals in Activity 11 


Note that when you’re evaluating a definite integral it’s sometimes more 
convenient not to use the constant multiple rule and/or the sum rule for 
the square bracket notation, even if there’s an opportunity to use them. 
You can choose whether or not to use them, as seems convenient. 


A constant multiple rule and a sum rule also hold for definite integrals, as 
follows. 


Constant multiple rule and sum rule for definite integrals 


b b 
ii k i æde = bf f(x) dz, where k is a constant 
b 


| ore tae f ro [ sae 


These rules follow from the fundamental theorem of calculus and the 
constant multiple rule and sum rule for antiderivatives, which you met 

in Unit 7. To see this for the constant multiple rule, suppose that f is a 
continuous function whose domain contains the numbers a and b, and 

that k is a constant. Let F be an antiderivative of f. Then, by the constant 
multiple rule for antiderivatives, kF (x) is an antiderivative of kf (x). 
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Hence, by the fundamental theorem of calculus and the constant multiple 
rule for the square bracket notation, 


You can prove the sum rule for definite integrals in a similar way. 


Remember that the sum rule for definite integrals also holds if you replace 
the plus sign on each side with a minus sign, because, as mentioned earlier, 
this is always the case when a constant multiple rule and a sum rule hold. 


The constant multiple rule for definite integrals tells you that, for example, 
the area in Figure 32(b) is twice the area in Figure 32(a). 


YA YA y=2sing 
2- 
Y = SiNT 
1-4 
ia. 
2 


Figure 32 The area in (b) is twice the area in (a) 


Similarly, the sum rule for definite integrals tells you that, for example, the 
area in Figure 33(c) is the sum of the areas in Figure 33(a) and (b). 


Yy=r? +T 


Figure 33 The area in (c) is the sum of the areas in (a) and (b) 


The next example illustrates how you can use the constant multiple rule 


and the sum rule for definite integrals when you’re evaluating a definite 
integral. 
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y=x+t2sing 


Figure 34 The definite 
integral in Example 7 
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Example 7 Using the constant multiple rule and sum rule for 
definite integrals 


Evaluate the definite integral 


ih (x + 2sin x) da. 
0 


(It is shown in Figure 34.) 
Solution 


@. Use the fundamental theorem of calculus. Use the constant 
multiple rule and sum rule, for definite integrals and for the square 
bracket notation, as convenient. © 


f (+2sina)ar= f cde + f 2sin x dx 
0 0 0 


T T 
= zaz+2 f sinz dx 
0 0 


ie T 

T |, = esa] 

IIT T 

zlo 0 

w — 07) — 2(cos a — cos 0) 


NIP Nie NIP ne eo 
= wile 


— 2[cos z] 


= 


2 
A c Al 


3 4 


You can practise using the constant multiple rule and sum rule for definite 
integrals in the next activity. 


Activity 12 Using the constant multiple rule and sum rule for 
definite integrals 


Evaluate the following definite integrals. (They are shown in Figure 35.) 
Give exact answers. 


VT T/4 
(a) i ig’ dx (b) f (sin z + cos x) dx 
V3 = 


m/4 
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y y =sinx +cosx 


Figure 35 The definite integrals in Activity 12 


As with the sum rule and constant multiple rule for the square bracket 
notation, you can choose whether or not to use the sum rule and the 
constant multiple rule for definite integrals in your working, as seems 
convenient. Often it’s more convenient not to use them, even if there’s an 
opportunity to do so. 


In the next activity, you’re asked to use the fundamental theorem of 
calculus to find the exact area of the cross-section of a roof that was 
discussed at the beginning of Subsection 1.1. 


Activity 13 Using the fundamental theorem for a practical problem 


The curve of the roof of a building is given by the graph of the function 
f(x) =9- Ar’, 

where x and f(x) are measured in metres. Use the ideas that you’ve met 

in this section to find the area between this graph and the z-axis from 


x = —18 to x = 18, as shown below. Hence state the area of the 
cross-section of the roof. 


You may remember that in Subsection 1.1 we used subintervals to work 
out that the area in Activity 13 is approximately 216 m?. The solution to 
Activity 13 confirms that the area is eractly 216 m?. 


Here’s another practical example for you to try. In this example the graph 
lies below the x-axis, so the signed area that you’ll obtain by using the 
fundamental theorem of calculus will be negative. You have to remove the 
minus sign to obtain the required area. 
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Activity 14 Solving another practical problem about area 


An engineer is planning the construction of an open channel with a 
parabolic cross-section, as illustrated below. The shape of the cross-section 
is given by the equation 


y= 2" — 200 (-100 <s < 100), 


where x and y are measured in centimetres, as shown in the graph below. 


The maximum height of liquid permissible in the channel is the height of 
the z-axis on the graph; that is, 200 cm above the lowest point of the 
channel. To ensure that the channel is adequate for the expected flow of 
liquid, the engineer needs to know the area shaded on the graph. 


(a) Find the z-intercepts of the graph. 


(b) Hence calculate the shaded area, to the nearest square centimetre. 
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The history of the fundamental theorem of calculus 


You’ve seen that the fundamental theorem of calculus relates rates of 
change — that is, gradients of tangents — to signed areas. 


One of the earliest mathematicians to relate the problem of 
calculating tangents to the problem of calculating areas was 

Isaac Barrow, the first Lucasian professor at the University of 
Cambridge. However, his method was presented in the geometrical 
style of the day and was not computationally useful for solving 
problems. Isaac Newton was one of Barrow’s students, and in 1669 
Barrow resigned his professorship so that Newton could take it up. 


Newton based his definition of quadrature (determining area) on the 
idea that it is the converse of finding tangents. So what for some 
people was, and for modern-day mathematicians is, a theorem that 
finding areas and tangents are inverse processes — the fundamental 
theorem of calculus — was taken by Newton as the very definition of 
quadrature. Newton’s way of regarding area questions as the opposite 
of tangency questions is set down clearly in his De analysi of 1669. 


Gottfried Wilhelm Leibniz published his first accounts of differential 
and integral calculus in the newly founded journal Acta Eruditorum 
Lipsiensium (Acts of the scholars of Leipzig) in 1684 and 1686 
respectively, and his proof of the fundamental theorem of calculus 
followed in 1693. 


The first rigorous proof of the fundamental theorem of calculus was 
provided in the early decades of the 19th century by the French 
mathematician Augustin Louis Cauchy, who included it in his course 
at the Ecole Polytechnique in Paris. Cauchy was one of the most 
productive and influential mathematicians of his generation. His 
reformulation of the foundations of calculus, while very thorough, was 
also very hard to understand — so much so that not only his students 
but also his fellow professors protested! 


Augustin Louis Cauchy 
(1789-1857) 
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2.3 Using the fundamental theorem to find 
changes in the values of antiderivatives 


The fundamental theorem of calculus isn’t useful just for solving problems 
related to finding areas. It’s also useful for solving problems involving rates 
of change. 


As you saw in Subsection 5.3 of Unit 7, in integral calculus you often want 
to solve problems of the following type. You have a continuous function f, 
and two numbers a and b in its domain, and you want to work out the 
amount by which an antiderivative of f changes from x = a to x = b. 
(Remember that all the antiderivatives of f change by the same amount.) 


If you can find a formula for an antiderivative F of f, then it’s 
straightforward to do this: you just calculate F(b) — F(a). You saw 
examples like this in Unit 7. 


If you can’t find a formula for an antiderivative, then you can use the 
fundamental theorem of calculus to help you find an approximate answer. 
The theorem tells you that, for any antiderivative F of f, 


b 
F(b) — F(a) ad Jayde. 


So the answer that you want is equal to the signed area between the graph 
of f and the z-axis, from x = a to x = b. You can use the method that you 
met in Subsection 1.1 to find an approximate value for this signed area. 
You would normally do that by using a computer, as mentioned earlier. 


For example, suppose that you have a velocity-time graph for an object 
moving along a straight line, such as the graph in Example 8 below. Since 
displacement is an antiderivative of velocity, the fundamental theorem of 
calculus tells you that the change in the displacement of the object, from 
any point in time to any other point in time, is equal to the signed area 
between the graph and the time axis, from the first point in time to the 
second point in time. This fact is used in Example 8. 


As with any graph, the units for signed area on a velocity-time graph are 
the units on the vertical axis times the units on the horizontal axis. So, for 
example, the units for signed area on the graph in Example 8 are 

ms ixs= m; 


that is, metres. These are units for displacement, as you’d expect. 


Example 8 Using the fundamental theorem of calculus for a 
velocity-time graph 

The graph below is the velocity—time graph of an object moving along 
a straight line. The areas (in m) of some regions are marked on the 
graph. 


2 The fundamental theorem of calculus 


In each of parts (a) to (c), use the given areas to find the amount by 
which the displacement of the object changes from the first time to 
the second time. 


(a) From v = 0407 = 4 (b) Froma = Aiton = 0] 
(e) From c = 0 to ¢ = 10: 


IIS A 

I, 34 
a] 

2 
i Ir time, t (seconds) 
3 sirens 
aa 6 7 8/9 10 
> 

=). 


Solution 
(a) The change in displacement from t = 0 to t = 4 is 9m. 
(b) The change in displacement from t = 4 to t = 10 is —7m. 


(c) The change in displacement from t = 0 to t = 10 is 
9m+ (-7)m=2m. 


Notice that the velocity of the object in Example 8 changes from positive 
to negative at time t = 4. That is, at this time the object changes from 
moving in the positive direction to moving in the negative direction, as 
illustrated in Figure 36. The negative displacement that occurs after this 
time cancels out most of the positive displacement that occurred initially. 
Since the object was displaced by 9m in the positive direction, and then 
by 7m in the negative direction, its final displacement from its starting 
position is 2m, as found in Example 8(c). The object travels a total 
distance of 9m + 7m = 16m during the 10 seconds of its motion. 


—o— qe Fi 
SO Gl e a a aa T a 
initial position position 
position after 10 after 4 
seconds seconds 


Figure 36 The object in Example 8 starts moving in the positive 
direction, then turns and moves in the negative direction 
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Activity 15 Using the fundamental theorem of calculus for a 
velocity—time graph 


The velocity—time graph of an object moving along a straight line is shown 
below. The areas of some regions are marked on the graph. Use these 
areas to find the answers in parts (a) to (c) below. 


time, t (seconds) 


= B42 H 3 6 


velocity, v (cm s71) 


(a) In each of parts (i) to (iv), find the amount by which the displacement 
of the object changes from the first time to the second time. 


(i) From t= 0 tot=1. (ii) From t = 3 to t = 4. 
(ii) From t = 0 to t = 3. (iv) From t = 3 to t = 6. 


(b) What is the displacement of the object from its starting position at 
each of the following times? 


(i) t=2 (ii) t=6 


(c) What is the total distance travelled by the object during the 6 seconds 
of its motion? 


If you want to solve a problem that involves finding a change in the value 
of an antiderivative, and you can find a formula for an antiderivative, then 
it’s usually convenient to use the notation for a definite integral, and the 
square bracket notation, in your working. Here’s Example 22 from 
Subsection 5.3 of Unit 7 again, but with the working carried out using this 
notation. 
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Figure 37 The graph of 
y = 52° (x > 0) 


In the next activity, you’re asked to repeat Activity 36 from Subsection 5.3 
of Unit 7, but using the notation for a definite integral. 


Activity 160 Finding a change in displacement 


Suppose that an object moves along a straight line, and its velocity v 
(in ms~') at time t (in seconds) is given by v = 3 — 3t (0 < t < 6) 
(as shown in Figure 38). 


(a) Write down a definite integral that gives the change in the 
displacement of the object from time t = 0 to time t = 1. Evaluate it, 
and hence state how far the object travels in that time. 


(b) Repeat part (a), but from time t = 4 to time t = 5. 


velocity, v (ms~') 


Try using the notation for a definite integral to solve the problem in the 
next activity. 


Figure 38 The graph of 


v=3-3t(0<t<6) 
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velocity, v (ms~*) 


TE 
time, t (seconds) 


Figure 39 The graph of 
v =28— Tt (0<t< 4) 


= = 
om oo 
fi pS 


velocity, v (ms~!) 


123 4 5 
time, t (seconds) 


Figure 40 The graph of 
v=3t (05225) 
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Activity 17 Finding another change in displacement 


The driver of a car travelling at 28ms~! along a straight road applies the 
brakes. It takes four seconds for the car to stop, and its decreasing 
velocity v (in ms~!) during that time is given by the equation 


v = 28 — Tt, 


where ¢ is the time in seconds since the brakes were applied (as shown in 
Figure 39). How far does the car travel during those four seconds? 


In this subsection you’ve seen examples of the following two useful facts. 


For the velocity-time graph of any object, and for any two points in 
time t = a and t = b in the time period covered by the graph, the 
following hold. 


e The change in the displacement of the object from t= a tot = b 
is the total signed area between the graph and the time axis from 
=G iO? = 


e The total distance travelled by the object from t = a to t = b is 
the total area between the graph and the time axis from t = a to 
t = b (this applies when a < b). 


Note that if a graph is a straight line, or consists of several line segments, 
then integration may not be the easiest way to work out an area or a 
signed area between the graph and the horizontal axis. For example, you 
can work out the area shaded in Figure 39 by using the formula for the 
area of a triangle, as shown at the end of the solution to Activity 17. Try 
using a similar geometric method in the next activity, which is a repeat of 
Example 23 in Unit 7. 


Activity 18 Finding yet another change in displacement 


Suppose that a car begins to move along a straight road, and its velocity v 
(in ms~') during the first five seconds of its journey is given by the 
equation 


v = 3t, 


where ¢ is the time in seconds since it began moving, as shown in 
Figure 40. How far does the car travel from the end of the fourth second to 
the end of the fifth second? 


2 The fundamental theorem of calculus 


2.4 Notation for indefinite integrals 


Until now, when we’ve been working with antiderivatives and indefinite 
integrals, we’ve been using a lower-case letter, such as f, to denote the 
function that we start with, and the corresponding upper-case letter, such 
as F, to denote any of its antiderivatives, or its indefinite integral. 


For example, some antiderivatives of the function f(a) = x? are 
F(a) = i2’, F(z) = g’ +7, and F(r)= ig’ — 2, 
and the indefinite integral of this function f is 
F(x) = r’ Fé: 


However, there’s a standard notation for indefinite integrals, which arises 
from the link between antiderivatives and definite integrals. For any 
function f that has an antiderivative, the indefinite integral of f(x) is 
denoted by 


f or 


(This notation is read as ‘the integral of f of x, d x’.) 


For example, 
[eu = iz? +c, 
and similarly 
[smear = — cosg + c. 


(You saw in Unit 7 that the indefinite integral of sin x is — cos x + c.) 


Note that this notation is used only for indefinite integrals — it’s never used 
for antiderivatives. So it’s important to remember the following fact. 


If an equation has the notation f ...dx on one side only, then there 
must be an arbitrary constant on the other side. 


For example, the equation 


Je dx = ir’ 


is incorrect. The correct equation is 
[eu = gu +c, 
as you saw above. 


We'll continue to use the capital letter notation for antiderivatives, when 
it’s convenient to do so. 
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The notation for indefinite integrals introduced above, and the similar 
notation for definite integrals, are together known as integral notation. 
We refer to either an indefinite integral or a definite integral as an 
integral. In the indefinite integral 


J Fed, 


the expression f(x) is called the integrand, just as in a definite integral. 


The next example illustrates how integral notation is used for indefinite 
integrals in practice. It’s similar to the examples and activities on finding 
indefinite integrals in the second half of Unit 7, except that it uses integral 
notation. 


2 The fundamental theorem of calculus 


You can practise using integral notation for indefinite integrals in the 
following activity. Don’t forget to add the ‘+c’ in each part. 


Activity 19 Finding indefinite integrals, using integral notation 


Find the following indefinite integrals. 


i) fea (b) fete- 2a (c) [au 


(d) fino- cosan (e) [re (£) [e 
po 
(g) [aa (h) [Ha 


Since an indefinite integral is just an antiderivative with an arbitrary 
constant added on, the constant multiple rule and sum rule for 
antiderivatives, from Unit 7, can be written in terms of indefinite integrals. 
These forms of the rules are stated in the following box, in integral 
notation. 


Constant multiple rule and sum rule for indefinite integrals 


frre) de = kf fe) dz, where k is a constant 


[E@)+9@)ae = f pe)ae+ f (0) ax 
As with the other constant multiple rules and sum rules, you can choose 
whether or not to use these rules in your working, as seems convenient. 


The next example repeats Example 10(b), but uses the constant multiple 
rule and sum rule for indefinite integrals in the forms stated above. 
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Here’s something to notice about Example 11. You might have thought 
that each of the two indefinite integrals in the second line of the working 
should have been replaced by an expression containing an arbitrary 
constant. That would lead to a solution something like this: 


eee a) 


= (x + ¢) — 5(ln |z| + d) 
=x — 5ln |z| + c — 5d, 
where c and d are arbitrary constants. 


However, since the arbitrary constants c and d can take any value, the 
expression c — 5d can also take any value, so we may as well replace it by a 
single letter that can take any value. It’s convenient to use the usual 
letter, c. In general, if an expression consists of a sum of constant multiples 
of indefinite integrals, then it needs only a single arbitrary constant. 


You can practise using the constant multiple rule and sum rule for 
indefinite integrals in the next activity. 


3 Integration by substitution 


Activity 20 Using the constant multiple rule and sum rule for 
indefinite integrals 


Find the following indefinite integrals. 


(a) f (; : ) ae (b) / E E de 


r 1+2 


You’ve now reached the end of the material in this module that tells you 
what integration is all about. In the rest of this unit you’ll concentrate on 
actually finding indefinite and definite integrals. 


Before you go on to that, you might like to watch the 25-minute video = 
The birth of calculus, which is available on the module website. It tells you Cc) 
more about the history of the development of calculus. 


3 Integration by substitution 


To allow you to fully exploit all the ideas about integration that you’ve 
met, it’s important that you’re able to find formulas for the indefinite 
integrals of as wide a range of functions as possible. In the rest of this 
unit, you'll learn to integrate a much wider variety of functions than you 
learned to integrate in Unit 7. In the final subsection of the unit, you'll 
learn how to use the module computer algebra system to find indefinite 
and definite integrals. 


Unfortunately, as mentioned in Unit 7, it’s generally more tricky to 
integrate functions than it is to differentiate them. That’s because there 
are no rules for antiderivatives that are similar to the product, quotient 
and chain rules for derivatives, so you can’t usually integrate functions in 
the systematic way that you can differentiate them. 


However, there are still many useful techniques that you can use to 
integrate functions. Each of these techniques applies to functions with 
particular characteristics. You’ll meet some of these techniques in the rest 
of this unit. You'll learn how to recognise functions to which each 
technique can be applied, and how to use the techniques. 


The technique that you'll learn in this section is integration by substitution. 
This technique might seem quite complicated when you first meet it, but 
you should find it more straightforward once you’ve had some practice 
with it. If you find it difficult, then make sure that you watch the tutorial 
clips, and try some extra practice. Remember that there are more practice 
exercises in the practice quiz and exercise booklet for this unit. 
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3.1 Basic integration by substitution 


Integration by substitution is based on reversing the chain rule for 
differentiation. 


Consider what happens when you differentiate an expression by using the 
chain rule. Remember that you start by recognising that the expression is 
a function of ‘something’. You differentiate the expression with respect to 
the ‘something’, then you multiply the result by the derivative of the 
‘something’ with respect to the input variable (usually x). For example, by 
the chain rule, 


d ;. 
in (sin(x?)) = (cos(x?)) (2x). 


Since differentiation is the reverse of integration, this equation tells you 
that 


f (cos(x°)) (2x) dx = sin(x?) + c. 
Whenever you differentiate an expression by using the chain rule, you 
always obtain an expression of the form 

f(something) x the derivative of the something, 


where f is a function that you can integrate. The essential idea of 
integration by substitution is to recognise an expression that you want to 
integrate as having this form, and then perform the integration by 
reversing the chain rule. 


To help you reverse the chain rule, it’s helpful to denote the ‘something’ by 
an extra variable, in the way that you did when you first learned to use the 
chain rule in Unit 7. The usual choice of letter for the extra variable is u. 


Here’s how you can use this method to find the integral 


fose) (2x) da, 
if you hadn’t first seen the integrand obtained as an ‘output’ of the chain 
rule. 
The first step is to recognise that the integrand has the form 

f (something) x the derivative of the something, 
where f is a function that you can integrate. You can see that it does, 
since it is 

cos(something) x the derivative of the something, 

2 


where the ‘something’ is x^. 


Then, since the ‘something’ is x2, you put u = z2. This gives du/daz = 22. 
Hence, in the integral you can replace cos(x?) by cosu, and 2x by du/dz, 
to give 


fost) = dz. 


Now here’s the crucial step that you need to apply at this stage. Any 
integral of the form 


fro Tds 


T£ 


is equal to the simpler integral 


I Pii 


This is the step that uses the chain rule, and you'll see an explanation of 
why it’s correct later in this subsection. Notice that it’s easy to remember, 
because it looks like we’ve simply cancelled a ‘dx’ in a denominator with a 
‘dx’ in a numerator. (Of course that’s not what we’ve done, since du/dx 
isn’t a fraction.) 


So the integral that we’re trying to find here can be expressed in terms 
of u as the simpler integral 


f cos u du, 


where u = x”. You can now do the integration, which gives 
sin u + c. 


The final step is to express this answer in terms of x, using the fact that 
u =x”. This gives the final answer 


sin(x?) + c. 


So we’ve now worked out that 
f (cos(x°)) (2x) dx = sin(x”) + c, 
as expected. 


Here’s a summary of the method used above, which is the method known 
as integration by substitution. 


Integration by substitution 
1. Recognise that the integrand is of the form 
f(something) x the derivative of the something, 


where f is a function that you can integrate. 
2. Set the something equal to u, and find du/dz. 


3. Hence write the integral in the form 
TEOLT 
d 
by using the fact that [tw = dr = [tw du. 


4. Do the integration. 


Substitute back for u in terms of x. 
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As illustrated by the example above, the effect of substituting the 
variable u in place of the ‘something’ in the integral is to reduce the 
integral to a simpler integral in terms of u, which you may be able to find 
more easily. 


You'll see another example of integration by substitution shortly, and 
you'll then have a chance to try it for yourself, but first it’s useful for you 
to learn about a convenient way to carry out step 3 of the method. 


Consider once more the integral above: 
J (cos(a?)) (2x) dx. 


Remember that we put u = 2”, which gave du/dx = 2x, and this allowed 
us to write the integral in terms of u as 


f cos u du. 


It’s convenient to think of this new form of the integral as being obtained 
from the original form by making two replacements, as shown below: 


feed) (2x) dz. 
— a ~ 


cosu du 


It’s straightforward to work out that you can replace cos(x?) by cosu, as 


that just comes from the equation u = x. 


A helpful way to work out that you can replace (2x) dx by du is to imagine 
‘cross-multiplying’ in the equation 


du 


eure 
dz T, 

to obtain 
du = (2x) dz. 


This isn’t a real equation, of course, since du and dx don’t have 
independent meanings outside the notation for a derivative or integral. 
But it tells you immediately that you can replace (2x) dx by du. (In fact, 
it’s possible to attach a meaning to du and dz, and hence to the equation 
above, but this is outside the scope of this module.) 


Here’s another example of integration by substitution, which illustrates all 
the ideas above. 


Example 12 /ntegrating by substitution 
Find the integral 


/ el sec? x da. 


Here are some examples for you to try. Make sure that you have your 
Handbook to hand, so you can consult the tables of standard derivatives 
and standard integrals. 


Activity 21 /ntegrating by substitution 


Find the following indefinite integrals. 


(a) / e82(_sinz)dx W J ae (322) de 


(c) / (=) ones a) J (==) [amdi 


1 
(e) Js zcoszar (£) I) cos x da 


In the examples that you’ve seen so far, it’s been fairly straightforward to 
decide what the ‘something’ should be, so that the integrand is of the form 


f (something) x the derivative of the something. 


However, sometimes you need to think about this more carefully, as 
illustrated in the next example. 
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Example 13 Choosing the ‘something’ when you integrate by 
substitution 
Find the integral 


re) 
—— |] coszdz. 
2 +sing 


Solution 


@. The integrand is of the form 
il 
——————— x the derivative of the something, 
2 + something 


where the something is sin x. 
It’s also of the form 


—— x the derivative of the something, 
something 


where the something is 2 + sin z. 

Choosing the first option would lead to the first part of the integrand 
being replaced by 1/(2 + u), which isn’t straightforward to integrate. 
Choosing the second option would lead to the first part of the 


integrand being replaced by 1/u, which you can integrate using a 
result from the table of standard integrals. 


So choose the second option: take the something to be 2 + sin z, and 


set it equal to u. & 


d 
Let u = 2 + sin z; then “M= COS T. 
dx 


@. Imagine ‘cross-multiplying’ to obtain du = cos z dz. ® 


So 


1 
H=) cosrdr= | Žau 
2+sinx UW 


= In|ul +c 
=In|2+sinz|+c 


@. In the particular case here you can remove the modulus signs, 
since 2 + sin z is always positive. ® 


= ln(2 + sinz) + c. 


In each part of the next activity it’s useful to take the ‘something’ to be an 
expression that includes a constant term, in a similar way to Example 13. 


Activity 22 Choosing the ‘something’ when you integrate by 
substitution 


Find the following indefinite integrals. 


(a) f+ 00s2)"(—sina) de (b) [VIFF ex) ae 


(c) I (a5 —8)!(524)dz (d) 1 (==) dy 


(e) / (sin(5 + 2x°)) (6x?) dz 


In the next activity, some of the integrals are similar to those in 

Activity 22, whereas others are similar to those in Activity 21. You have 
to decide in each case what the ‘something’ should be. Remember that the 
idea is that when you replace the something by u in the first part of the 
integrand, you obtain an expression in u that you can integrate. 


Activity 23 Choosing the ‘something’ when you integrate by 
substitution 


Find the following indefinite integrals. 


(a) f behed W) f el+8in2 (cog 2) de 


(c) f (=) (1029)de (d) I a ae 


Finally in this subsection, as promised near the start, here’s an 
explanation of why 


fro Eas f rau, 


where the variable u is a function of x, and f is a function that you can 
integrate. To justify this, we let F be an antiderivative of f, and apply the 
chain rule to F(u). This gives 


d du 


dr (F(u)) = f(u) Ti 


It follows from this equation that 


[to fe - (u) +c. 


However, because F is an antiderivative of f, we also have 


[tw dna POD. 


Comparing the final two equations above proves the result. 
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3.2 Integration by substitution in practice 


Usually when you have an integrand that you can integrate by 
substitution, it won’t initially be exactly in the form 


f (something) x the derivative of the something. 


You have to start by recognising that it can be rearranged into this form, 
then rearrange it accordingly. Here are two examples that illustrate some 
things that you can do. 


Example 14 Rearranging an integral before using substitution 


Find the integral 


fe cos(e”) da. 


Solution 


@. The derivative of e” is e”, so the integrand is in the required form, 
except that the two factors cos(e”) and e” are in the ‘wrong’ order. If 
you swap them, then the integrand will be exactly in the required 
form. @& 


d 
Let u = e”; then e 
dx 


@. Imagine ‘cross-multiplying’ to obtain du = e” dx. Then start by 
swapping the two factors in the integrand. © 


So 
fe cos(e dr = J (coste") e” dx 


= [cosuau 


=sin(u) +¢ 
= sin(e”) + c. 
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Example 15 Rearranging another integral before using substitution CS) 
Find the integral = 


je (1 + ay dz. 
Solution 


@. The derivative of 1 + x? is 3x”, which is ‘nearly’ x? (it just has the 
wrong coefficient). So if you swap the order of the two factors x? and 
(1+ z3), then the integrand will be nearly in the required form. ©@ 


d 
Let u = 1+ 2°; then ee 
dx 


@. Imagine ‘cross-multiplying’ to obtain du = 32? dz. Then start by 
swapping the two factors in the integrand. ® 


So 
je (Cee dr = fat aar 


®. The only problem is that in place of z? you need 327. To achieve 
this, multiply by 3 inside the integral, and divide by 3 outside, to 
compensate. (The constant multiple rule for indefinite integrals tells 
you that it’s okay to do this.) ® 


= J (1 aa) (3x7) dx 


@®. Now do the substitution. ® 


=} fubau 


Se lg 
= 3X gu are 


Here’s something to notice about Example 15. You might have thought 
that when the integral f u® du in the fourth-last line of the solution was 
replaced by gu +c, the solution should have proceeded like this: 


=} fau 


(gu? +c) 
u? + $C 


(1+ 2%)? + ke. 


2 


| 
y- Nj- “= 
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However, since the arbitrary constant c can take any value, the expression 
$C can also take any value, so we may as well replace it by a single letter 
that can take any value. It’s convenient to use the usual letter, c. 


This sort of thing occurs frequently when you integrate. You never need to 
multiply or divide a term that is an arbitrary constant by a number — you 
just replace it by another arbitrary constant, and you usually use the 
standard letter, c. 


Another point to remember from the working in Example 15 is the 
strategy of multiplying by a constant inside an integral, and dividing by 
the same constant outside to compensate. It’s often helpful to do this 
when you use integration by substitution. It’s okay to do it because the 
constant multiple rule for indefinite integrals tells you that for any 
function f that has an antiderivative, and any non-zero constant k, 


fro d= | Erfe) i= = fet) m 


Remember that the constant k can be either positive or negative. In 
particular, it can be —1. 


Remember too that k must be a constant. You can’t do the same thing 
with an expression in 2. 


Here are some examples for you to try. 


Activity 24 Rearranging integrals before using substitution 


Find the following integrals. 


(a) J a” cos(a?) da (b) ev 1+2?dzx (c) T cost x sin z dx 


The next example illustrates another way in which you can sometimes 
rearrange an integrand to get it into the right form for integration by 
substitution. 


© 
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Solution 


@. The derivative of the denominator 1 — x? is —2x, which is ‘nearly’ 
the numerator x (it just has the wrong coefficient). So if you split the 
numerator from the rest of the fraction, then the integrand will be 
nearly in the required form. ® 
Let y= l= g then a = —?2r. 

da 


@. Imagine ‘cross-multiplying’ to obtain du = —2x dx. Then start by 
splitting the numerator from the rest of the fraction in the 
integrand. ® 


So 


ae il 
[t] (a) oe 


@. The only problem is that in place of x you need —2z. To achieve 
this, multiply by —2 inside the integral, and divide by —2 outside to 


compensate. ® 
1 
ee 
= -4f (=) (—2r) dz 


@®. Now do the substitution. ® 


1 


=-}lnlul +c 


=-—$ln|l—27|+<. 


The type of integral in Example 16 occurs quite frequently, so it’s worth 
learning to recognise it. You can use integration by substitution to find 
any integral of the form 
‘i the derivative of the something d 
x 
something 


or 
f the derivative of the something 
(something)” 


dz, where n is a constant. 


The constant n can be any nonzero real number. Here are some integrals 
of this type for you to find. 
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Activity 25 Rearranging more integrals before using substitution 


Find the following integrals. 


is) [r o foo (c) |e 


Here are three more integrals for you to find by using substitution. 
They’re a mixture of the types in Activities 24 and 25. 


Activity 20 Rearranging even more integrals before using 
substitution 


Find the following integrals. 


(a) fe singar (b) |; (c) f O+ isina) cos x dx 


— et 


You can use integration by substitution to find the indefinite integral of 
the tangent function. You’re asked to do that in the next activity. 
Activity 27 Finding the indefinite integral of tan 


By writing 


sin £ 


tanz = ; 
COS £ 


and using integration by substitution, find 


f tan z dz. 


Remember that when you’ve found an indefinite integral, you can always 
check your answer by differentiating it. This can be particularly helpful 
when you found the integral by using integration by substitution. 


3.3 Integrating functions of linear expressions 
This subsection is about a particular type of integral that occurs 


frequently, and which you can find by using integration by substitution. 


The integrals 


1 
fsz- dz, /o« and ie da 


all have integrands of the form f(ax + b), where f is a function, and 
a and b are constants with a Æ 0. In other words, each of the integrands is 
of the form f(a linear expression in zx). 


You can always find an integral of this form by using integration by 
substitution, provided that you can integrate the function f. Here’s an 
example. 
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Here are some similar integrals for you to find. 


Activity 28 /ntegrating functions of linear expressions 


Find the following integrals. 


(a) 1 Si — (b) fi wowe w f T 
(a) 1 stis tð f e f a 
(2) i aot (h) f ryt (i) / mo 


The next example uses exactly the same ideas as Example 17 and 
Activity 28, but is a little more complicated. 


Example 18 /ntegrating another function of a linear expression 
Find the integral 


1 
T 
ore i 


Solution 


1 
@. You can write the integrand as ————.,, which is of the 
1+ (32)? 


form f(3x) where f(x) = . The integral of this function f is a 


1+ 2? 
standard integral. Since the derivative of 3x is 3, you need to 
rearrange the given integral to obtain 3 inside. ® 


Let u = 3a; then ae =o) S0 
dx 


1 i 
Naa de 
[rts 1+ (87)? 
= | er z ock 
=: | 
3 Toe 


= tan! U+E 
= 4 tan !(3z) + c. 
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Here are some similar integrals for you to find. 


Activity 29 Integrating more functions of linear expressions 


Find the following integrals. 


(a) lee (b) |r (c) [rr 


Hint for part (c): start by writing 
1 il . 1 
— as -—x —,—. 
44002 “ 4° 1439x2 


The answers to Activities 28 and 29 all follow the same general pattern, as 
follows. Suppose that you have an integral of the form 


[tox + b) da, 


where a and b are constants with a 4 0, and you know an antiderivative F 
of the function f. To find the integral by substitution, you put u = ax + b, 


which gives i: =a. Then 


| tar +d)ar = ~ | flax +b)ade 

=< f rau 
eee 

= “Flax +b) +6. 


So we have the following general fact. 


Indefinite integral of a function of a linear expression 


If f is a function with antiderivative F', and a and b are constants 
with a Æ 0, then 


J flax+d)dx = 2Flar +8) +e. 


It’s worth remembering this fact, and using it to deal with integrals where 
the integrand is a simple function of a linear expression, rather than using 
integration by substitution every time. For example, the fact above tells 
you immediately that 


[ costae — 2) dx = $ sin(3x — 2) + c. 
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The following special case of the fact in the box above is useful especially 
often. 


Indefinite integral of a function of a multiple of the variable 


If f is a function with antiderivative F’, and a is a non-zero constant, 
then 


J fax) di = T F(aa) FE 


For example, 


m dz = —je >" +c. 


Try using the facts in the boxes above in the following activity. 


Activity 30 = /ntegrating functions of linear expressions efficiently 


Find the following integrals. 


a f elias @ / hesi I e3t+4 dt 
(a) | wto (¢) I edp (f) / —— d 
w f ar (h) f srr- iis A / sin(x) da 
G) i Stw (k) / ede (I) i araid 


The next example is of a type that you met in Unit 7, but it involves a 
function that you’ve learned how to integrate in this section. 
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Example 19 Integrating in a practical situation A 


The velocity of an object moving along a straight line is given by the 
equation 

v= 4sin (št — 2), 
where t is the time in seconds since the object began moving, and v is 
the velocity of the object, in ms~!. The displacement of the object at 
time ¢ = 0 is 3m. 


Find an equation for the displacement s (in metres) of the object, in 
terms of t. Give the constant term to two decimal places. 


Solution 
@. Use the fact that displacement is an antiderivative of velocity. ® 


The velocity v (in ms™!) of the object at time t (in seconds) is given 
by 


v = $sin (št — 2). 


Hence the displacement s (in m) of the object at time t (in seconds) is 


given by 
s= f sin (ĝt 2) dt 
1 1 
m yp ee 


= —4 cos (3t — 2) +c, 

where c is an arbitrary constant. 
@. Use information given in the question to find the value of c. ® 
When t = 0, s = 3. Hence 

3 = —4 cos (3 x0- 2) +c 

3 = —4 cos(—2) + c 

c=3+ $ cos(—2) 

C= $ cos 2 

e= 280 (W 2 dp) 
So the equation for the displacement of the object in terms of time is 


g= 4 cos (3t — 2) + 2.86. 
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Activity 31 /ntegrating in a practical example 
The velocity of an object moving along a straight line is given by the 
equation 
v = 2 cos (5¢+5) F 
where t is the time in seconds since the object began moving, and v is the 


velocity of the object, in ms~!. The displacement of the object at time 
t=O is 4m. 


(a) Find an equation for the displacement s (in metres) of the object, in 
terms of t. Give the constant term to two decimal places. 


(b) Find the displacement of the object at time t = 10. Give your answer 
in metres to two significant figures. 


3.4 Integration by substitution for definite integrals 


Once you’ve used integration by substitution to find an indefinite integral, 
you can find any definite integral with the same integrand by using the 
fundamental theorem of calculus in the usual way. Here’s an example. 


Figure 41 The definite 
integral in Example 20 
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@. Apply the fundamental theorem of calculus. ® 


Hence 
37/2 
i i 37/2 
i ee * cos nda = e] r 
T2 
= esin(3x/2) a esin(n/2) 
Ze! 
il 
=-—e 
e 


—2.35 (to 3 s.f.). 


An alternative way to use integration by substitution to evaluate a definite 
integral is to work with the definite integral from the start, rather than 
first finding an indefinite integral. If you do this, then because the limits of 
integration are values of x, when you change the variable from x to u you 
must also change the limits of integration to the corresponding values of u. 


This method is demonstrated in the next example. The definite integral in 
this example is the same as the one in Example 20. 


g — ) 
Example 21 Evaluating a definite integral by substitution (second CS) 
method) ae 


Evaluate the definite integral 


3n/2 ; 
i Ecos ride. 
T2 

Give your answer to three significant figures. 


Solution 
@. Use integration by substitution. ©& 


; du 
Let u = sin z, then — = cos zx. 


dx 


@. Find the values of u that correspond to the values of x that are 
the limits of integration. ® 


T 
Putting x = 7/2 into u = Sima gives u = sin (5) =i, 


3 
Putting w = 37/2 into u = sing gives u = sin (=) =k 
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You can try using this method in the next activity. 


Activity 32 Evaluating definite integrals by substitution 


Evaluate the following definite integrals. Give your answers to three 
significant figures. 


1 x T 0 3 
(a) f I dz (b) L, cos? z sin x dx (c) D re`?” dr 


3.5 Finding more complicated integrals by 
substitution 


You can sometimes use integration by substitution to find an integral even 
if the integrand isn’t obviously of the form 


f(something) x the derivative of the something. 


In this subsection, you’ll see some examples where the integrand is nearly 
of this form, except that it also contains a further factor, which is an 
expression in x. If you can express this further factor in terms of the 
variable u that you use for the substitution, then you may be able to 
simplify the integrand into a form that you can integrate. Here’s an 
example. 


Example 22 Expressing a further factor in terms of u when 
integrating by substitution 


Find the integral 


ie o ei)? as, 


Solution 


®, If the integral were just 


fe 41) de, 


then we could use the substitution u = 3x + 1, and find the integral 
by first writing it as 


1 fee? x 3dz. 


However, the integrand contains a further factor, x + 1. Try using the 
substitution, expressing the further factor x + 1 in terms of u. ® 


d 
Let u = 3x + 1. Then a 
dx 
Also, rearranging the equation u = 3x + 1 gives 


u = 3x +1 
3r=u- 1 


So 
[e+nect Idr = i [@+@r+ 1)? x 3dr 
= 4 f Hu 2w au 
= y f (w+ 2)u° du 
/ (9 + 2u®) du 
(mu +2 x pu”) FCE 


E on Wile 
=p tat +e 


= (3r + 1)" + Brt 1) +e. 


E ol ole 


3 


T 
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Here are two similar integrals for you to try. 


Activity 33 Expressing further factors in terms of u 


Use integration by substitution to find the following indefinite integrals. 


(a) [eviveae (b) [Ee 


Yov’ll learn more about integration by substitution in the module MST125 
Essential mathematics 2, if it’s in your study programme. 


4 Integration by parts 


In this section you’ll meet another integration technique, integration by 
parts. Like integration by substitution, this technique is useful for 
integrating functions that have certain characteristics. It’s based on the 
product rule for differentiation. 


4.1 Basic integration by parts 


Suppose that f and g are functions, and that the function G is an 
antiderivative of the function g. Consider the equation that you obtain 
when you use the product rule to differentiate the product f(x)G(zx): 


L (F(2)G(a)) = fe) (= G(e)) +G(2) (= f(z) : 


that is, 

d 

aa CCE) = F(a) g(a) + Ge) f'(x). 
If you swap the order of G(x) and f'(x) in the final term, and subtract this 
term from both sides of the equation, then you obtain 


L (F()G(a)) - f(@)G(x) = Flea). 


Swapping the sides of this equation gives 


f(x)g(x) = E (f(2)G(x)) — f'(@)G(a). 


4 Integration by parts 


If you now integrate both sides of this equation with respect to x, then you 
obtain 


f Tenai- | f'()G(2) dz. 


There’s no need to add an explicit arbitrary constant here, because it’s 
included in the integral f f(x)g(x) dz on the left-hand side, and in the 
integral f f’(x)G(x) dx on the right-hand side. 


The formula above is known as the integration by parts formula. It’s 
stated again in the box below, for easy reference. If you’ve met it before, 
then you may have seen it stated in a different form. This is explained at 
the end of this section. 


Integration by parts formula (Lagrange notation) 


| f(x)g(x) dx = f(x)G(x) — J f'(x)G(a) de. 


Here G is an antiderivative of g. 


The integration by parts formula is useful for integrating some products of 
the form f(x)g(x). You can see that it changes the problem of integrating 
f(x)g(x) into the problem of integrating f'(x)G(x), where G is an 
antiderivative of g. 


So it’s useful for integrating products of the form f(x)g(x) that have the 


following characteristics. 


e You can find an antiderivative G of g; in other words, you can 
integrate the expression g(x). 


e The product f'(x)G(x) is easier to integrate than the product 
f(x)g(x). For example, this may be the case if f'(x) is simpler than 
f(z). 

When we integrate by using the integration by parts formula, we say that 

we're integrating by parts. Here’s an example. 
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Cc) | Example 23 Integrating by parts 
Find the integral Hf ge Shia 40 Cle. 


Solution 


@. The integrand is a product of two expressions, x and sing. You 
can integrate the second expression, sin x, and differentiating the first 
expression, x, makes it simpler. So try integration by parts. ® 


Let f(x) = end g(@) = e a 
Then f’(x) = 1, and an antiderivative of g(x) is G(x) = — cos x. 


So the integration by parts formula gives 
[esinede = f toso) da 
= H(e)G(o)- | f"@)G@) ae 


= £ X (—cosz) = fi x (— cos x) dx 


—x cos T + [cosa 
@. Find the integral in this expression. ©& 


= —g cos x + sin g + c. 


You can see that in Example 23 the use of the integration by parts formula 
changed the problem of integrating x sin x into the problem of integrating 
cos x, which is easier. Here are two similar examples for you to try. 


Activity 34 Integrating by parts 


Find the following integrals. 


(a) f ceeds W f prd 


As you become more familiar with integration by parts, you’ll probably 
find that you can carry it out more quickly and conveniently if you 
remember the informal version of the formula stated below. You can recite 
this version in your head as you apply integration by parts, in a similar 
way to the informal versions of the product and quotient rules for 
differentiation that you met in Unit 7. It’s useful for applying integration 


178 


4 


by parts in fairly simple cases, like the ones that you’ve seen so far, which 
is usually all that you need to do. 


Integration by parts formula (informal) 


of second 


ee lof derivative : antiderivative 
Ca PEES of first of second i 


integral of product = (first) x ee) 


In the next example, the integration in Example 23 is repeated, using the 
informal version of the integration by parts formula. 


You can practise using the informal version of the integration by parts 
formula in the next activity. Alternatively, you might prefer to continue 
using the formal version for a little longer, and move to the informal 
version when you feel more confident with it. You can use the exercises on 
integration by parts in the practice quiz and exercise booklet for this unit 
for further practice. 


Integration by parts 
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Activity 35 = /ntegrating by parts 


Find the following integrals. 


(a) J Code W / weds A / needa 


When you use integration by parts, remember that it matters which 
expression in the integrand you take to be the ‘first expression’, and which 
you take to be the ‘second expression’. If you’re trying to use integration 
by parts, but you find that it leads to an integral that’s more complicated 
than the original integral, then try swapping the two expressions in the 
original integrand. 


4.2 Integration by parts in practice 


There are several frequently occurring types of integrals that often have 
the right characteristics for integration by parts. In this subsection you'll 
meet three of these types. It’s helpful for you to learn to recognise them. 


Integrands of the form x” g(x) 
You can sometimes find an integral of the form 
J x” g(x) dx, 


where n is a positive integer and g(x) is an expression that you can 
integrate, by using integration by parts n times. 


All the integrals in the previous subsection were of this form, with n = 1, 
so you could find them by using integration by parts once. Here’s an 
example where you have to use integration by parts twice. 


Integrating by parts gives 
ee dz = x? x ze" — [ep x ze" dz 


= me” — yf xe dz 


@. The integral in this expression, f xe*” dz, looks easier to find than 
the original integral, but it’s still not straightforward to find. 
However, it is itself of the form described above, so try integration by 
parts again. ® 


@. Multiply out the brackets. Take care with the minus sign in front 
of the brackets — remember that it turns the minus sign inside the 
brackets into a plus sign. ® 


Si Pt eg ale ep all 4x 
= qre gre +t fe dx 


@. Now find the integral in this expression, and simplify the resulting 
expression. ® 


De ie. ee 
8 Saeed 
2 4 il Ag IAT 
te = Re Tae rE 


= qe" (8x? — 4r +1) +c. 


Activity 30 Integrating by parts twice 


Find the following integrals. 


(a) f rrd W f Piod 


The method introduced above for dealing with integrands of the form 
x”g(x) works just as well if the expression x” is replaced by any 
polynomial expression in x of degree n. For example, you can find the 
integral f (3x? — 1)e** dz by integrating by parts twice, and you can find 
the integral in the next activity by integrating by parts once. 
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Activity 37 Integrating an expression of the form (ax + b)g() 


Use integration by parts to find the integral I (5a + 2)e™™ dx. 


Integrands of the form x” lnz 
Suppose that you want to integrate an expression of the form 
ae” Ine, 


where r is a constant (any real number). Your first thought, particularly if 
r is a positive integer, might be that this is similar to the expressions that 
you saw how to integrate earlier in this subsection. However, you can’t 
apply the integration by parts formula with ln as the second expression, 
because In z isn’t an expression that you’ve seen how to integrate. 


In fact, you can find an integral of the form above by applying the 
integration by parts formula with ln x as the first expression. Here’s an 
example. 


Activity 38 Integrating expressions of the form x" ln x 


Find the following integrals. 


(a) fma (b) [mear 


Hint for part (b): write Inz as (ln x) x 1. 


You can use the ideas that you used in Activity 38 to deal with other, 
similar integrals that involve the natural logarithm function. Try the 
following activity. 


Activity 39 /ntegrating another expression containing the function In 


Use integration by parts to find the integral I (x — 2) Iln(3x) dz. 


Integrands of the form e°” sin(bx) or e°” cos(bx) 
You can integrate an expression of the form 
e% sin(bx) or e°% cos(bxr), 


where a and b are nonzero numbers, by integrating by parts twice. When 
you do this, you obtain an equation that expresses the original integral in 
terms of itself, and you can rearrange this equation to find the original 
integral. The method is demonstrated in the example below. 


When you use this method, you can arrange the exponential expression 
and the trigonometric expression in the integrand in either order before 
you integrate by parts for the first time. For example, you can start with 
either f e% sin(bx)dx or f sin(ba)e"* dz. However, you must choose the 
same order (exponential before trigonometric, or trigonometric before 
exponential) when you integrate by parts for the second time. If you don’t 
do that, then the equation that you’ll obtain will reduce to 0 = 0, which 
isn’t helpful! 
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Example 27 Finding an integral by expressing it in terms of itself 
Find the indefinite integral l e” cos(2x) dx. 


Solution 


@. The integral is of the form discussed above, so use integration by 
parts twice. Start by swapping the expressions e” and cos(2z), as this 
makes the working slightly easier (it avoids introducing fractions in 
the early stages). & 


We use integration by parts twice. 
fe cos( 25) = J costae) e* dx. 
@. Integrate by parts. ® 
J costae) e” dx = cos(2x)e” — fe sin(2x))e* dx 
= cos(2x)e” + 2 | sin(2x)e” dx 
@. Integrate by parts again. Ad 
= cos(2x)e” + 2 (sint2ayer — fe cos(22) )e” ar) 
= cos(2z)e” + 2sin(2x)e” — 4 f cos(2a)e* dx 


@. The final term here contains the original integral. Move this term 
to the other side of the equation. When you do this, the right-hand 
side no longer contains an indefinite integral, so you have to add an 
arbitrary constant to that side. @ 


So 

Josen dz + 4 | cos(2z)e" dx = cos(2x)e” + 2sin(2x)e* + c; 
that is, 

5 | cos(2r)e" dx = cos(2x)e” + 2sin(2x)e” + c. 


®. To obtain the final answer, divide both sides by 5. (Remember 
that you leave the arbitrary constant as c rather than c/5.) Also, it’s 
slightly tidier to write the exponential expression before the 
trigonometric expression in each term, and to take out a common 
factor. ® 


This gives 
fe cos(2x) dx = $e" cos(2x) + $e” sin(2x) +c 


= $e"(cos(2x) + 2sin(2x)) + c. 


Activity 40 Finding integrals by expressing them in terms of 
themselves 


Find the following integrals. 


E / Pade © / Feni 


4.3 Integration by parts for definite integrals 


Once you’ve used integration by parts to find an indefinite integral, you 
can find any definite integral with the same integrand by using the 
fundamental theorem of calculus in the usual way. Here’s an example. 


Example 28 Evaluating a definite integral by parts (first method) 
Evaluate the definite integral 


1 
| re” dz. 
0 


Solution 


@. Use integration by parts to obtain the indefinite integral of the 
integrand. ® 


Integration by parts gives 


o j- fix ze" dz 


S me ol (ee 
= zte sje dx 
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An alternative way to use integration by parts to evaluate a definite 
integral is to apply the limits of integration ‘as you go along’. To do this, 
you use the following formula, which is just the usual integration by parts 
formula with the limits of integration included. 


Integration by parts formula for definite integrals 


b 4 b 
| eyed = FOCH] -= | F (x)G(a) da. 


Here G is an antiderivative of g. 


You can remember this formula as an informal version, by applying the 
limits of integration a and b in the appropriate way to the informal version 
of the usual integration by parts formula. 


The next example demonstrates how to apply the limits of integration as 
you go along when you integrate by parts. The definite integral in this 
example is the same as the one in Example 28. 


Activity 41 Evaluating a definite integral by parts 


Use the integration by parts formula for definite integrals (the method of 
Example 29) to evaluate the definite integral 


1 
| re?” dz. 
0 


Give your answer to four decimal places. 


In the next two activities you’re asked to use integration by parts to find 
areas. 


Activity 42 Using integration by parts to find an area 


This question is about the function f(x) = xsin(3z). 


(a) Explain why the graph of f lies above the x-axis for values of x in the 
interval [1/12, 7/6]. 

(b) Write down a definite integral that gives the area between the graph 
of f and the z-axis from x = 7/12 tox = 7/6. 


(c) Use the integration by parts formula for definite integrals to find the 
area described in part (b). Give your answer to four decimal places. 
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Activity 43 Using integration by parts to find another area 


This question is about the function f(x) = (x — 2)(e~* — 1). 


(a) Explain why the graph of f lies above the x-axis for 0 < x < 2 and 
below the x-axis for x > 2. You might find it convenient to use a table 
of signs as part of your explanation. 


(b) Find the total area (not the total signed area) between the graph of f 
and the x-axis, from x = 0 to x = 4. Give your answer to four 
significant figures. 


As mentioned earlier, in some texts the integration by parts formula is 
expressed in a different form. 


Here’s the formula that you’ve seen in this section: 


J festa) ax = F@)G(@) - f Fele) ae. 
It involves a function G and its derivative g. 


If you denote this pair of functions by g and g’ instead, then you obtain 
the alternative version below. 


Integration by parts formula (Lagrange notation, alternative 
version) 


J fod @)ae = fogla) - | Feed 


As with all calculus formulas, the integration by parts formula can be 
stated in Leibniz notation. 


If you put u = f(x) and v = g(x) in the formula in the box above, then 
you obtain the following form of the formula. 


Integration by parts formula (Leibniz notation) 


dv du 
feas w- fva 
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5 More integration 


You’ve now met several integration techniques, and practised using them 
on a variety of integrals. 


In this section, you’ll start by meeting one further approach, which can be 
helpful for integrals that contain trigonometric expressions. Then you’ll 
have an opportunity to practise identifying which of the integration 
techniques that you’ve met are appropriate for which integrals, for a 
variety of different types of integrals. 


Finally, you'll learn how to use a computer for integration. 


5.1 Trigonometric integrals 


In Section 3 you saw that you can integrate some expressions that contain 
trigonometric functions by using integration by substitution. Another 
approach that’s often helpful when you want to integrate an expression 
that contains trigonometric functions is to start by using one of the 
trigonometric identities that you met in Unit 4 to write the expression in a 
different form. This can give you an expression that’s easier to integrate. 
For example, you can use the identities below to help you integrate the 
expressions sin? x, cos? x and sin x cos x, as illustrated in the next example. 
The first two identities are the half-angle identities, and the third identity 
is one of the double-angle identities. 


Three trigonometric identities 
sin? 9 = 3(1 — cos(26)) 
cos? 6 = 4(1 + cos(26)) 
sin(20) = 2 sin 0 cos ð 
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In the next activity you’re asked to use two of the trigonometric identities 
stated near the beginning of this subsection to integrate another two 
expressions that contain trigonometric functions. 


Activity 44 Integrating by using trigonometric identities 


(a) Use a half-angle identity to find the integral / cos? 8 dé. 


(b) Use the double-angle identity in the box above to find the integral 


sin x cos x dz. 


The trigonometric integral in Example 30 and the one in Activity 44(a) 
can’t easily be found without first using a trigonometric identity to write 
them in a different form. 


However, there’s an alternative way to find the integral in Activity 44(b). 
Since the derivative of sin x is cos x, you can use integration by 
substitution, taking u = sin x. You’re asked to do that in the next activity. 


Activity 45 = /ntegrating a trigonometric expression by using 
substitution 


Use integration by substitution to find the integral f sin x cos g dz. 


In fact there are even two different ways to find the integral in 
Activity 44(b), namely 


[om xcosx dz, 


by using integration by substitution. One way is as suggested above: you 
use the fact that the derivative of sin x is cosa and hence make the 
substitution u = sin x. Alternatively, you can use the fact that the 
derivative of cos x is — sin xz, write the integral as 


- | (cose)(—sin.) ae 


and make the substitution u = cosx. The working for each of these 
methods is given in the solution to Activity 45. 


So you’ve now seen three different ways to integrate the expression 

sin xz cosg: one way by using a trigonometric identity, and two ways by 
using substitution. These three different ways lead to three answers that 
look different, as you can see in the solutions to Activities 44(b) and 45. 
This illustrates that there can be more than one way to integrate an 
expression, and different ways of integrating it can lead to answers that 
look different, though of course these answers will be equivalent. 


If an answer that you obtain for an integral is different from the answer 
given in a book or by a computer, but you think that you haven’t made a 
mistake, then you may be able to check that the two answers are equivalent 
by using algebraic manipulation. You may need to use trigonometric 
identities — there are examples of this in the solution to Activity 45. 


A rougher, but sometimes quicker, check is to plot the graphs of the two 
different answers on a computer (taking the arbitrary constants to be zero, 
for example), and check that they seem to be vertical translations of each 
other. This shows that the two different answers seem to be equivalent 
once the arbitrary constants are taken into account. Again, there’s an 
example of this in the solution to Activity 45. 


In this subsection you’ve seen just a few examples of how you can use 
trigonometric identities and integration by substitution to help you 
integrate expressions that contain trigonometric functions. There are many 
more possibilities, some of which are covered in the module Essential 
mathematics 2 (MST125). 
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5.2 Choosing a method of integration 


When you have an integral that you want to find, it may not be obvious 
which method to try. Also, sometimes you might need to combine two or 
more techniques — for example you might be able to use algebraic 
manipulation, the constant multiple rule and the sum rule to express an 
integral in terms of simpler integrals, then use integration by substitution 
or integration by parts to find the simpler integrals. 


Here are some things to think about when you have an integral that you 
want to find. 


Choosing a method for finding an integral 

e Is it a standard integral? Consult the table in the Handbook. 

e Can you rearrange the integral to express it as a sum of constant 
multiples of simpler integrals? 


e Is the integrand of the form f(ax) or f(ax + b), where a and b 
are constants and f is a function that you can integrate? If so, 
use the rule for integrating a function of a linear expression, 
or use integration by substitution, with u = ax or u = ag + b as 
appropriate. 


e Can the integrand be written in the form 
f (something) x the derivative of the something, 


where f is a function that you can integrate? If so, use 
integration by substitution. Start by setting the ‘something’ 
equal to u. 


e Is the integrand of the form 


f(x)g(@), 


where f is a function that becomes simpler when differentiated, 
and g is a function that you can integrate? (For example, f(z) 
might be x or z? or Inz.) If so, try integration by parts. 


e Ifthe integrand contains trigonometric functions, can you use 
trigonometric identities to rewrite it in a form that’s easier to 
integrate? 


When you're finding an indefinite integral, try to always remember to add 
the arbitrary constant ‘+c’. It’s easy to forget it! 


In the next activity you can practise identifying which method of 
integration to use. 


Activity 46 Choosing a method of integration 
For each integral below, suggest a method of integration to use. Where you 
suggest substitution, also suggest a suitable equation for u in terms of x. 


You are not asked to actually find the integrals, but solutions are included 
so you can do so if you wish. 


(a) [28 cos(9e") a (b) J cos(a) dz (c) J(e + cos52))ax 


1 x 
(a) fats) w i |rt (f) [wore 
(g) fee? dz (h) [om (52) cos (4x) da (i) fe — 1)f dz 
(j) ff e*(1—2e") dz (k) J jisne U T 2(302 — 2)8 der 


You'll learn further useful strategies for integration in the module Essential 
mathematics 2 (MST125), if it’s in your study programme. In particular, 
you'll learn more about integration by substitution and integration by 
using trigonometric identities, and you'll study partial fractions, which are 
useful when you want to integrate rational functions. 


Maybe we could make 
a substitution? 


Um wot sure we've 
got the right parts! 
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5.3 Integration using a computer 


There are many situations in which it’s helpful to use a computer for 
integration, such as when you want to integrate an expression that’s 
difficult to integrate by hand. In the first activity of this subsection you 
can learn how to use the module computer algebra system to find 
indefinite integrals and evaluate definite integrals. 


Activity 47 Using a computer for integration 


Work through Section 9 of the Computer algebra guide. 


You can use the skills that you learned in Activity 47 in the next activity. 


Activity 48 Using a computer to work with a complicated function 


This activity is about the function 
f(e) = x’ — 10x +15 
1+Vx41 ` 


Do parts (b) to (d) below by using the module computer algebra system. 
Give your answers to parts (c) and (d) to three significant figures. 


(a) What is the domain of f? 

(b) Plot the graph of f. 

(c) Find the two z-intercepts of f. 
( 


d) Find the area between the graph of f and the z-axis, between the two 
x-intercepts. 


The next two activities are the final ones of the unit. In them, you’re 
asked to use your knowledge of integration to solve practical problems. 
The definite integrals involved in the problems can’t easily be evaluated by 
using the fundamental theorem of calculus, as it’s difficult to find 
antiderivatives of Vz? + 1 and sin(t?/150). However, you can use a 
computer to find approximate values. 


5 More integration 


Activity 49 Using integration to solve a practical problem aca 


Consider the plastic edging shown below. The shape of its cross-section is 
given by the equation 


where x and y are measured in centimetres. 


Calculate the volume of plastic that is required to make one metre of the 
edging. Give your answer in cubic centimetres to three significant figures. 


Activity 50 Using integration to solve another practical problem NM] 


Suppose that an object moves in a straight line, and its velocity v 
(in ms~') at time t (in seconds) is given by the equation 


12 
= sin | —— 0<t< 20 
v =sin (5) (0 <t< 20), 
as shown in Figure 42. 


If the displacement of the object from a particular reference point on the 
line at time t = 0 is 6m, as shown below, what is the displacement of the 
object from this reference point at time t = 10? Give your answer to the 


> 


nearest centimetre. T T T T 
5 10 15 20 £ 
6m 
R e— o , Figure 42 The graph of 
v = sin(x? /150) (0 < t < 20) 
reference position position 
point at time at time 
t= 0 t= 10 
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You’ve now come to the end of the introduction to basic calculus in this 
module. You can use the skills that you’ve acquired to solve many different 
types of problems in mathematics, engineering and science, for example. 
These skills also equip you to study many further topics in these areas. 
You'll make a start on that in Unit 11, in which you'll use calculus in your 
study of Taylor polynomials. If your study programme includes further 
mathematics modules, then in some of these you may study more 
advanced calculus, with even more powerful applications. 


Learning outcomes 


After studying this unit, you should be able to: 
e understand what is meant by a definite integral 


e understand how to use subintervals to find an approximate value for a 
definite integral 


e use integral notation 
e understand the fundamental theorem of calculus 


e evaluate some definite integrals by using the fundamental theorem of 
calculus 


e integrate by substitution 
e integrate by parts 


e appreciate how to use trigonometric identities and integration by 
substitution to help you integrate some trigonometric expressions 


e identify a suitable method of integration for some types of integrals 


e use a computer for integration. 


Solutions to activities 


Solution to Activity 1 


If we divide the interval [—18, 18] into six 
subintervals of equal width, then each subinterval 
has width 36/6 = 6. 


The left endpoints of the six subintervals are 


—18, —18+6, —18+2x 6, 
—18+3x6, —18+4x6, —18+5x6 
that is 


18, —12, —6, 0, 6, 12. 
So the heights of the rectangles are: 
f(—18) = 9- $ x (—18) = 0 


f(-12) = 9- $ x (-12)? =5 
f(—6) = 9 — 35 x (-6)? = 
f0) =9- 4x0 =9 
f(6)=9- $ x 6° = 


f(12) =9- $ x12 =5. 
So we obtain the following approximate value for 
the area: 

(0 x 6) + (5 x 6) + (8 x 6) 

+ (9 x 6) + (8 x 6) + (5 x 6) = 210. 


Solution to Activity 2 
(a) The signed area from x = 1 to x = 5 is —6.3. 
(b) The signed area from x = 5 to x = 7 is 1.3. 


(c) The signed area from x = 1 to x = 7 is 
—6.3+1.3=—5. 


(d) The signed area from x = 1 to x = 9 is 
—6.3 + 1.3 — 5.9 = —10.9. 


Solution to Activity 3 
If we divide the interval [—3, 3] into six subintervals 
of equal width, then each subinterval has width 
6/6 =1. 
The left endpoints of the six subintervals are 

a ey ema 


Solutions to activities 


So the heights of the rectangles are: 
f(-3) =3- (-3)? = —6 


f(-2) =3—(-2)? = -1 
f(-1) =3-(-1)? =2 
f0) =3-0=3 
fj)=3-P= 
f(2)=3-2?=-1 


So we obtain the following approximate value for 
the signed area: 


(-6 x 1)+(-1 x 1)+ (2x 1) 
+(3x 1)+ (2x 1)+(-1x 1) =-1. 


Solution to Activity 4 


The exact value of the signed area between the 
graph of the function f(a) = 3 — x? and the x-axis 
from z = —3 to x = 3 is 0. 


Solution to Activity 5 


(a) The signed area from x = 2 to x = 6 is 4.0. 


(b) The signed area from xz = 2 to x = 6 is 4.0, 
so the signed area from x = 6 to x = 2 is —4.0. 


(c) The signed area from x = 2 to x = 10 is 
4.0 — 5.4 = —1.4. 


(d) From part (c), the signed area from x = 2 to 
x = 10 is —1.4, so the signed area from x = 10 
to x = 2 is 1.4. 


(e) The signed area from x = 8 to x = 8 is 0. 


(£) The signed area from z = —3 to v = 2 is 
4.1 — 2.4 = 1.7, so the signed area from x = 2 to 
z=-—3is —-1.7. 


) The signed area from x = —1 to x = 10 is 
—2.4+ 4.0 — 5.4 = —3.8, so the signed area 
from x = 10 to x = —1 is 3.8. 
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Solution to Activity 6 
=3 


_3 
(h) : f(z) dz = — f f(x) dx = —6 
2 7 
(i) i Heia- a a 
—5 -3 
(G) f(z)dz=-— | f(æ)dz=-(-5)=5 
-3 r 
a . 
(k) f(z)dx=—- | f(z)dz 
7 -3 
= =(=5) =9,;, 
by part (e) 
l d ‘ d 
(1) ; f(x) dx = — 2) z 
= —(—10) = 10, 


by part (f). 


Solution to Activity 7 


[toas f tea f sean 
SO 
45 = [ f(a) dx 


and hence 


[ s@a- 15 — (-11) = —4. 
7 
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Solution to Activity 8 
(a) [da?]; =$x5?-5 x3? 


(b) [cos a] 
(c) ela = e! — e71! = e — (1/e) = 2.35 (to 3 s.f.). 


= (25 —- 9) = 8. 


2m 
o = cos(2m) —cos0 = 1- 1=0. 


Solution to Activity 9 


(a) An antiderivative of x? is 4x, so 


(b) An antiderivative of ef is ef, so 


2 
f e dt = le] 
0 


— e? — el 
=e? — 1 = 6.39 (to 3 s.f.). 


(c) An antiderivative of 1/u (for u > 0) is Inu, so 


f tauni 
ET u = nuj 


=]n4—ln1 
= ln 4 = 1.39 (to 3 s.f.). 


(d) An antiderivative of sec? 6 is tan, so 


m/4 
f sec” 0 d0 = [tan o] me 


= m/A 
= tan 7 — tan (-7) 
=1-(-1) 


(e) An antiderivative of 1/(1 + u?) is tan! u, so 


1 
1 -1 11 
| wew- [tan ul 
= tan~'(1) — tan™t(—1) 
T T 
7-4) 
= /2 
= 1.57 (to 3 s.f.). 


Solutions to activities 


Solution to Activity 10 


4 4 -1/2 
(a) f 3vz dz =) 3a1/? da 
— [1,2 3 
1 1 l F = [žu 2 x žu ie 
= b x z” 1 
3/2 = |lu? — 243 
j i E 3% lap 
= [ae _ fie] B e] 
=) x 48/2 9 x 13/2 a Jiz L3- -1/2 
1 1 
_ ile 2 la 
aoe | aël a 
= 14. 
=} -(-b)-30-C 
(b) J ed+oex =10-4)-2044) 
l =}xi-3x? 
_ 2 
= ere ee =3-3 
=al 
= (ho? + 40"), j 
= G x 02 +4 x 0°) _ (4(-1)? + 4(—1)%) Solution to Activity 12 
=-(i—-}) v7 vi 
2 3 (a) f 52° dx = >f r? dz 
=-—ł4. v3 v3 
V7 
4 4 Dia 
2 T 2 r 
í =i 
= [r+5Inr], ~ 8 V3 
= (4+ 5ln4) — (2+ 51n 2) = + ((vV* - (v3)") 
= 2 + 5(In4 — ln 2) = 1(7? — 3?) 
= 2+ 5ln(4/2) (49 — 9) 
=2+4+5n2. a» 
s P T/4 
Solution to Activity 11 (b) i EEE 
5 =T 
(a) f x?/? de = E en n/A4 
2 / 3 = sincdx + | cos x dx 
_ Ea —n/4 —n/4 
5 T T 
r = [- cosa] a + [sin z] a 
— 5/2 
= 3 [e]; = — [cos z] "4 + [sin z] / 
—r/4 m/4 


2(5°/? _ 35/2) 


Sacks =- (cos (7) - cs (-3)) 


(8) 


l 
a 
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Solution to Activity 13 


The required area is given by 


“i. | 
a 
= 
Co 

w 


= (=18)") 


= 324 — 108 
= 216. 


So the area of the cross-section of the roof is 216 m?. 


Solution to Activity 14 
(a) The z- eee are given by 


a — 200 = 0 
a = 200 

x? = 6400 

a = +v6400 

x = +80. 


So the x-intercepts are —80 and 80. 
(b) The shaded area is 


Tis 0)° 2 200(80 = (—80)) 
= (2 x e — 200 x 160 


Doe, 


200 


Hence the shaded area is 21 333 cm?, to the 


nearest cm?. 


Solution to Activity 15 


(a) (i) From t=0 to t= 1 the displacement of 
the object changes by 2.3 cm. 


(ii) From t = 3 to t = 4 the displacement of 
the object changes by —1.8cm. That is, it 
changes by 1.8cm in the negative direction. 


(iii) From t = 0 to t = 3 the displacement of 

the object changes by 
(2.3 + 5.9 + 2.7) cm = 10.9 cm. 
(iv) From t = 3 to t = 6 the displacement of 
the object changes by 
(—1.8 — 2.0 — 0.3) cm = —4.1 cm. 
That is, it changes by 4.1cm in the 
negative direction. 
(b) (i) The displacement of the object from its 
starting position at time t = 2 is 
(2.3 + 5.9) cm = 8.2 cm. 

(ii) By (a) parts (iii) and (iv), the 
displacement of the object from its 
starting position at time t = 6 is 

(10.9 — 4.1) cm = 6.8 cm. 
(c) By parts (a)(iii) and (a)(iv), the object travels 
10.9cm in the positive direction, then 4.1 cm in 


the negative direction, so the total distance that 
it travels is 


(10.9 + 4.1) cm = 15 cm. 


Solution to Activity 16 


(a) The change in the displacement of the object 
from time t = 0 to time t = 1 is given by 


l 1 
l (3 — 3t)dt = “Be 3x w 


3t — val 
0 


Pimak 


3 
= 3(1 — 0) — ł(1? — 0°) 
J 

=$ 


= 2.25. 


That is, the object or 2.25m in that time 
(in the positive direction). 


(b) The change in the displacement of the object 
from time t = 4 to time t = 5 is given by 


E 5 
/ G- giat = fat- 3 x 30°] 


= —3.75. 


That is, the object travels 3.75m in that time 
(in the negative direction). 


Solution to Activity 17 


The change in the displacement of the car during 
those four seconds is given by 


i 742 E 
/ (28 — Tt) dt = [28t — Zt l 
4 4 
= 28t]; - 3t lo 
= 28(4 — 0) — £(4? — 0) 
= 112-56 
= 56. 


That is, the car travels 56m during those four 
seconds. 


(In fact, since the required answer is the area of the 
shaded triangle in Figure 39, you can calculate it in 
the following more straightforward way. 


The change in the displacement of the car during 
the four seconds is 


(5 x 4x 28)m = 56m.) 


Solution to Activity 18 


The change in the displacement of the car is the 
area shaded in Figure 40, which is given by 


5x5x15-5x4x 12 = $(75 — 48) 
= į x 27 = 13.5. 
So the car travels 13.5 m during the fifth second. 


(There are various ways to work out the area 
shaded in Figure 40. It is worked out above by 


subtracting the area of one triangle from the area of 


another, but alternatively you could add the areas 
of a rectangle and a triangle, or use the formula for 
the area of a trapezium.) 


Solutions to activities 


Solution to Activity 19 
(a) fea See 


(x? + « — 6) dz 


— 
€ 
S 
+ 
2 
— 
8 
| 

N 
<~ 
jor 
8 
I 


= $x? + $2? — 6r +c. 


& 
— 
BH 

a 

R 
I 
Cee: 
8 
eo 
a 
8 


1 
=f +e 
1 
= a2 C 
(a)_[ (sind = cose) )d@ = — cos 0 — sin 0 + c. 
=] 
o Jz 773 dr = tan r+c. 
—3 
© fy Ty te u = —3tan ute. 
© | aat = 5 tan r+e. 


(h) [H aeae 


(3z + 2°) +c. 


ale 


Solution to Activity 20 


D o 


-Jz as “fia 


= 31n |z| — tan™t x + c. 


(b) J (cosec x)(cosec x + cot x) dx 
= (cosec” x + cosec x cot x) da 


= foose xdr + f cosec x cot x dx 


= — cot x — cosec x + c. 
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Unit 8 Integration methods 


Solution to Activity 21 


d 
(a) Let u = cos x; then T Z singz. So 
dz 
fe sina) de= f e" du 
=e"+c 
= eon + C. 


d 
(b) Let u = z3; then T = 3x?. So 


| e de = / aida 
=—cosu+c 


= —cos(x*) + c. 


d 
(c) Let u = sin z; then 7 = cos T. So 
T 


1 
I(s )coszde= f <a 
sin £ u 


=In|u| +c 


= ln |sin z| + c. 


d 
(d) Let u = cos x; then T = — sinz. So 


l | 
s e 
Ù 
A 
Q 


Il 
| 
g 
l 
= 
+ 
QO 


cos £ 
= — sec T +c. 

(A quicker way to find this indefinite integral, 

without using substitution, is to write the 

integrand as — sec x tan and use the table of 

standard indefinite integrals.) 


du 
(e) Let u = sin z; then — = cosx. So 


dx 


fsm zcoszaz= fut du 


=, 6 
=u +c 


z Esin” x +c. 
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(f) Let u = sin z; then i = cosx. So 
z 


1 1 
I) ossds= f (5) du 


= tanlu +c 


= tan ‘(sinz) +c. 
Solution to Activity 22 


d 
(a) Let u = 4 + cos x; then Z 


k —sinz. So 
x 
fatos sin x) dz = fra 
= sue +c 
= 4 (4 + cos z)? + c. 


(b) Let u = 1 + 2?; then = = 27. So 


[VFF Qaar u 


= [wPau 


_ 1 ar 
= 372" +c 


= F(1 +r e. 


(c) Let u = x° — 8; then Le ba. So 
dx 


= au +C 


d 
(d) Let u = e” +5; then T =e? 0 


1 
I( Jerar= f au 
e? +5 u 


=In|u| +c 
=Inle’ + 5| +c 
= ln(e” + 5) + c. 


d 
(e) Let u = 5 + 22°; then = = 6x?. So 
x 


f (sin(5 + 2x°)) (6x7) dz = [sud 
= — cosu +c 
= — cos(5 + 22°) + c. 


Solution to Activity 23 


d 
(a) Let u = e”; then <= So 


dx 
J costen))e* dt = [cosuau 
=sinu +c 
= sin(e”) + c. 


d 
(b) Let u = 1 + sin z; then 7 = cos x. So 
hi 


paaa dx = fe du 


=e"+ce 
= eitsin x +e. 


d 
(c) Let u = x1? + 6; then T = 107°. So 


I (a5) (102°) dz = | Zau 


=In|u| +e 
= In|z'° + 6| +c 


= ln(z? +6) + c. 


d 
(d) Let u = cos z; then a 


o — sing. So 
x 
J cos? x)(— sin x) dz = fe du 

= tuf +e 

= $ cos* x + c. 
Solution to Activity 24 

d 
(a) Let u = z3; then = 3x". So 
x 


T7 cos(x?) dz = I (cos(x?)) z? dz 
= } f (cos(e*)) (802) a 


= 1 foosudu 


Solutions to activities 


(b) Let u = 1 + 27; then Oe 2x. So 
dg 
Jev TF a= | (VTF) zaz 
=; f 
2 
=; f 
2 
=; f 
2 


=i 
=A 


(c) Let u = cos z; then 


Su 
dx 


(vi + z2) x Qa dg 
vudu 


u? du 


2y3/? +e 


lu? +c 


(1+ eh c. 


— sing. So 


f cos! zsinzaz = = ficos x)(—sin x) dx 


=- fut 


pe Or 
=— 5u FE 


Solution to Activity 


1 


25 


cos? 2: TAGs 


d 
(a) Let u = 1 + 227; then 7 = 4g. So 
x 


z 
|rt 


IG 


¢In|ul +c 


¢In|1 


+In(1 


+ 2z7|+¢ 
+ 227) +c 


(since 1 + 2x? is always positive). 
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Unit 8 


(b) 


Integration methods 


d 
Let u = 2 + e”; then U = et, So 


dz 


d 
(c) Let u = 3 — sin z; then = = — cosg. So 
z£ 


cosg 
—— daz 
3— sing 


3 
1 
=- f cau 
u 
= —ln|u| +c 


= —ln |3 — sin z| + c 
ln(3 


sing) +c 


(since 3 — sin z is always positive). 


Solution to Activity 26 


(a) 


du f 
Let u = cos x; then — = — sing. So 


|e ined - f e*(~sinz) dx 


dx 


=- feau 


= —e" +c 
== E T 


du 


(b) Let u = 3 — e”; then — = —e”. So 
dz 
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ev 
d 
i á 


2 -J (=) (—e")dz 


1 
=- f Zau 
u 
= —ln |u| +c 
= —ln|3 — e| + c. 


=- | (5) Cona fimzaæ= | 


(c) Let u = 1 + į sin z; then 


for $ sing 


du 


dg 


)? cos z dx 


= 2 fa + į sin z)? (4 cos x) dx 


=2 [au 


=2x ju? +c 


25.223 
= 3uU +c 


= 2(1+ $sinz)* +c. 


Solution to Activity 27 


du i 
Let u = cos x; then — = — sin x. So 
dx 


sin £ 


COS £ 


= f (=) Coe 


= —l]ln 


= —l]n 
Solution to Activi 


(a) Let u = 6x — 1; 


oa dx = 


Jul +e 
| cos z| + c. 


ty 28 


du 
hen — = 6. 
ien i 6. So 


p fem x 6dx 
feau 


ev +e 


ert Le 


ole Ole Ole 


(b) Let u = 42; then Ta 4. So 
dx 


f sin(4x) dz 


= + finta) x 4dg 


= + | sinudu 


= 4(—cosu) +¢ 
z= —4cosu+c 


= —{ cos(4x) + c. 


=i 
=9%9 COS T. 


So 


Solutions to activities 


(c) Let u = —9za; then am —9. So (g) Let u = 2x + 1; then “uo 2. So 
dx dx 


fetrar=—3 f e x (-9)ae [ar e) x 2dz 


atk u 1 
= ife du =} f Sau 
1, u 
=-5e tce E: =2 
Lleo =4 fu du 
5 . 
É =l 
d =S—sl ac 
(d) Let u = —2/3; then = = —4, So _ A ` 1 ip 
=—5 
fev®ar=-a fe x (—4) dr A 
s) 7 —" 9g) 
= —3 | e“du 
d 
(h) Let u = 1 — z; then es E E T 
= —3e"+e dx 
— _9,-2/3 il 1 
= —3e j +c. [e-l] x (—1)dz 
(e) Let u = 3 — Tz; then — = —7. So 1 
dz =- |5 
u 
[costs — 7x) dz = —4 [costs — 7x)) x (—7) da 
=- fuau 
=l 
= b f cosudu s a 
= -sinu +c ERT 
= —4 sin(3 — 7x) + c. a 
= Le 
(f) Let u = 4 — z; then SY = -1 So 2(1 — x)? 
z 
du 
il 1 A PEEN ere 
[es] (Fe) «oe (i) Let u = 6x; then — = 6. So 
E f: d f seee) dz = £ | (sec*(62) x 6dr 
=—In|ul +c =} sec? udu 
= —lnj4— z| + c. = {tanu +c 


= ł tan(6x) + c. 
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Unit 8 Integration methods 


Solution to Activity 29 


du 
Let u = 2a; then — = 2. S 
(a) Let u x; en a, o 


Jae] aer 


ii 
a ete 


= sin tu +c 


= sin (2x) + c. 


x 2dr 


(b) Let u = v2 z; then = = V2. So 
£ 


1 1 1 
oe = | — Vd 
ra v2) 14 (V2z)° 
1 1 
-z| an 


1 —1 
= — tan ` u+c 


v2 


= ga (vas) Fe: 


I 
Ie 
x 
wile 
+ 
mm] BH 
NIW 
8 
NYS 
N 
x 
NIW 
Q 
8 


ale 
—, —, 

— 
pe 

— 

NIW 
8 

“—" 

N 


Ale OIF ole 


Solution to Activity 30 


(a) f osuo) dx = 4 sin(10x) + c. 


(b) f soe- 52) dz = —#(— cos(2 — 5x)) + c€ 


= t cos(2 — 5x) + c. 
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(c) e d= zet4 HE 
(a) f E = 2ain(26) +c. 
(e) je dp=—je +c. 


1 1 


dr = -ln |1 -r| + c. 
l-r 


J 
(h) [ses — 1) dz = $ tan(5x — 1) + c. 
J 


(j) fe dz = -e +c. 
(k) Jor da = 2e"/* + c. 


(1) [sore — 3p) dp = —4 tan(2 — 3p) + c 


= į tan(3p — 2) + c. 
(The final expression was obtained by using the 
trigonometric identity tan(—@) = — tan 0.) 


(m) [sin (=) de= f sin (3 — 40) dz 


=—3 (— cos (2 = ix)) +e 


2 = 
= 3cos ( =) +6 


Solution to Activity 31 


(a) The velocity v (in ms~') of the object at time t 
(in seconds) is given by 
v = 2 cos (5t +5) : 
Hence the displacement s (in m) of the object 
at time t (in seconds) is given by 


s= [200s (dt +5) dt 
=2 x 2sin (št +5) +c 
= 4sin (4t +5) + c, 
where c is an arbitrary constant. 


When t = 0, s = 4. Hence 

4=4sin($ x0+5)+c 

4 = 4sin(5) + c 

c = 4 — 4sin(5) 

c= 7.835... 

c = 7.84 (to 2 d.p.). 
So the equation for the displacement of the 
object in terms of time is 


s = 4sin (4t + 5) + 7.84. 
(b) When t = 10, 
s = 4sin (5 x 10 +5) + 7.835... 
= 4sin(10) + 7.835... 
= 5.7 (to 2 s.f.). 


So the displacement of the object at time 
10 seconds is 5.7m (to 2 s.f.). 


Solution to Activity 32 
d 

(a) Let u = 1 + 227; then oT = 4e. 
dz 


Putting x = 0 gives u = 1 and putting x = 1 
gives u = 3. So 


1 1 
— es ae 
f a= / (a) Uovae 


= ;(In3 — ln 1) 
= 7ln3 
= 0.275 (to 3 s.f.). 


Solutions to activities 


d 
(b) Let u = cos z; then T ina: Putting 


x 
x = 7/2 gives u = 0 and putting x = 7 gives 
u = —1. So 


T T 
f cos? z sin z dz = — f cos? z(— sin x) dx 
T/2 T/2 


II 
l 
z 
a 
g 
w 
Qa 
= 


Ale Ble mo 


| 
él 


= —0.25. 


(Notice that in the second line of the 
manipulation above we obtain an integral, 

i u®, in which the lower limit of integration is 
greater than the upper limit of integration. An 
alternative way to proceed from this line is to 
use = fact that, in general, 

KI rjdg= =i f(x) dx. Then the 

AS Sorte ie like this: 


0 
_ [1,4 
= [aet], 
0 
= rikoa 
a (=(=) 
E 
4 
= —0.25.) 
Let u = —32?; then ae = — 6r. 
dx 


Putting x = —1 gives u = —3 and putting x = 0 
gives u = 0. So 


0 F 0 : 
[oe dr= af aa ) (—6x) da 


=-§le 

=A (le) 
sae =I) 
—0.158 (to 3 s.f.). 
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Unit 8 Integration methods 


Solution to Activity 33 Hence 
(a) Let u = 1 + z; then = = 1. Also, r = u — 1. So Ja da = froo dx 
x 
[eviFeae= fu- Vidu = f@)C@)- | f"(e)G(a) az 
= f (e-u) du =e- | eas 
u5/2  u8/2 = re — e +c 
eg a re = e”(2 — 1) + 
5/2 3/2 e*(a c. 


= 2(1+2)°/? — 2(1 +2)? +c. Solution to Activity 35 


du 
(b) Let u = x + 3; then Ep = 1. Also, rx = u — 3, so (a) f esinta) dz =x (—+ cos(52)) 
x—2=u-—D5. Hence 


[ee [Saw - [ 1x (-$eos(62)) dx 
(x +3)’ u3 
= [we — 5u73) du = —tacos(5x) + $ foose) dx 
— _ynl 4 by? 4 ¢ = —ixcos(5r) + 4 x łsin(5x) +c 
1 z = —4r cos(5x) + $ sin(5z) + c 
sa a = 4 sin(5x) — tz cos(52) + c. 
1 5 
Spp ertt œ) fae de=a ($e) - fix ($e) de 
1 2T Al 2% 
; aT —i — 1 d 
Solution to Activity 34 ane 2 Je = 
(a) Let f(x) =x and g(x) = cosx. Then f'(x)=1 = tre" — i x fe +c 
and an antiderivative of g(x) is G(x) = sin z. = fre” — fe** +c 
Hence — 162" (2z Nee 
[ veose az = | t@s@) dx 


= f(z)G(x) — fræse) dz ee + fer da 


=asine— | sinedr = —ge™® + (-e-*) +e 


= rsin g — (— cosx) + c -zre -e+e 


= xsin g + cos T + c. = —e "(x +1) +c. 


(b) Let f(z) = x and g(x) = e”. Then f’(z) =1 
and an antiderivative of g(x) is G(x) 


=e 
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(c) f aeea f 1 x (-e7*) dz 


Solution to Activity 36 


(a) [oe dx = ae” — [pe dx 
= re” — 2 [er dg 
= r’e" —2 (2e - fi x de) 
= ge” — 2ge” + 2 fe dx 


= ge" — 2ge” + 2e” +c 
= e" (x? — 2g +2) + c. 


(b) I x?’ cos(2x) dx 
x’ (4 sin(2x)) — fag sin(2x)) da 


dx” sin(2r) — f esin(2r) dz 


2 


$27 sin(2x) 


— (24 cosa) - fi 4 (—} eos(20)) de) 


$x° sin(2x) + 42 cos(2x) — z f cos(a) dz 


= $2” sin(2x) + 4x cos(2x) — 4 x $ sin(2x) + c 


1 
2 
1 (2x? — 1) sin(2x) + $x cos(2z) + c. 


Solution to Activity 37 
fe +2)e 5" dz 


(5x + 2) (=) eer f» (=) e "de 


= —1(5a + 2)e 7” + fe da 
= -i( 


1 
5 
E(5x + 2)e 5" — ter dxz+c 
—te "(5x +241) +¢ 
=—#(5r+3)e°" +. 


(An alternative way to find this integral is to start 
by writing it as 


am dz+2 f e= dz. 


The first integral in this expression has an integrand 
of the form zg(x), where g(x) is an expression that 
you can integrate, so you can find this integral by 
using integration by parts. The second integral is 
straightforward to find.) 


Solutions to activities 


Solution to Activity 38 

(a) a hicde = [on dx 

fat fF x dat dx 
4 Par 


1,4 1 3 
zz ine} fa dx 


= (ln x) x 


= tztlnz — - x qui +c 
= par (4lng — 1) +c. 


(b) [mea = fone x ldz 
(naje f= x zdz 


zmz- [ 1de 


glnia—=i1--¢ 
x(Inz—1)+e. 


Solution to Activity 39 
fe — 2) In(3x) dx 
= [wee — 2) dz 


ii 
= In(3a) x ($2? — 2x) — - ($a? — 2a) dx 


J 


Le(x — 4) In(3x) — [Ge 


= $2(x — 4) In(32) — 4 fe — 4) dx 
= $a(x — 4) In(3x) — $($27 — 42) +c 
= $2(x — 4) In(3x) — 4z(x — 8) + c 
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Solution to Activity 40 
(a) We write 


fe sin(3x) dz = f sinea) e” dz. 


Integrating by parts twice gives 
f soea) e” dz 
= sin(3x)e” — fe cos(3x))e” da 


= sin(3x)e” — 3 / cos(3x)e* da 


= sin(3x)e” 


-3 (cos(3a)e" 7 f (—3sin(3x))e* ar) 


= sin(3x)e” — 3 cos(3x)e” — 9 f sin(3x)e” da. 


So 
10 [ sin(3a) e” dz 


= sin(3x)e” — 3 cos(3x)e” + c 


and hence 
f soea) e” dz 


= 1 sin(3r)e” — 3 cos(3x)e” + c. 


10 10 
That is, 
Je sin(3ax) dz 
= fe" sin(3x) — $e” cos(3x) + c 


= fe" (sin(3x) — 3cos(3zx)) + c. 
(b) Integrating by parts twice gives 


e2* cos x dx 


= e* sing — fee) sin x dx 
= e°” sing — ae sin z dx 
= e” sinz 


-2 (ec cos £) — i (2c?*)(— cos x) ar) 


= e?” sin z + 2e?” cos z — 4 f e°? cos z da. 
So 


5 | è coszde = e?” sin z + 2e?” cosg +c 
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and hence 
ie cos z dz = te" sin z + 2%" cos x +c. 
That is, 
22 — 1,29 /.: 
fe cos z dz = e” (sin z + 2 cosx) + c. 


Solution to Activity 41 


1 
3 
1 
3 
1 
9 
1 
9 
= 4.5746 (to 4 d.p.). 


Solution to Activity 42 
(a) If x € [7/12, 7/6], then 3a € [rt/4, 7/2]. 
Now 
sinô > 0 when 0 € [7/4, 77/2], 
so 
sin(3x) > 0 when z € [x/12, 7/6]. 
Also 
x > 0 when z € [7/12, 7/6]. 
Hence 
xsin(3xz) > 0 when z € [7/12, 7/6). 
That is, the graph of f(x) = xsin(3z) lies above 
the a-axis for values of x in the interval 
[7/12, 2/6]. 


(b) It follows from part (a) that the area between 
this graph and the z-axis from x = 7/12 to 
x = T/6 is 


T/6 
f x sin(3x) dz. 


/12 


(c) Integrating by parts gives 


T/6 
f x sin(3x) dx 


/12 


= [x(—-4.cos(32))]""" - f METT 


|l 
l 
wl 
=n 
8 
O 
=] 
v 
P 
w 
8 
x 
iL 
3 A 
S 
e O 
N 
+ 
wl= 
R 
~ a 
= ae 
N fon) 
O 
O 
n 
— 
w 
8 
< 
em 
8 


(ala) - la) 


Eran -sim(3)) 


ee eee 
36V2 *\ v2 
= 0.0943 (to 4 d.p.). 
(The area found in this solution is shown below.) 


y = xsin(3z) 


Solution to Activity 43 


(a) To help us construct a table of signs for the 
function f(x) = (x — 2)(e~” — 1), first we 
consider the two factors of the expression 
(a — 2)(e~* — 1) separately. 


The function y = x — 2 is increasing on its 


whole domain, and the x-intercept of its graph 


is © = 2. 


aN 
A 


Solutions to activities 


The function y = e~* is decreasing on its whole 
domain, so the function y = e~* — 1 is also 
decreasing on its whole domain. The x-intercept 
of its graph is given by e~* — 1 = 0, which gives 
eo = 
—z=Inl 
=% = 0 
g=. 
So its x-intercept is z = 0. 
So we have the following table of signs, for 
x > 0. 


The table shows that the graph of the function 
f(x) = (a — 2)(e~* — 1) lies above the z-axis for 
0 <a < 2 and below the x-axis for x > 2. 


Because the graph of f lies above the x-axis for 
0 <a < 2 and below the z-axis for x > 2, to 
find the total area between the graph of f and 
the x-axis from x = 0 to x = 4 we have to work 
out the signed area from x = 0 to x = 2 and the 
signed area from x = 2 to x = 4 separately. 


The signed area between the graph of f and the 
z-axis from x = 0 to x = 2 is given by 


fe — 2)(e"® —1)dz 


= 2(—1) + |-e* + ral 
= —2 + (—e7? + 2) — (-e~° + 0) 
S=J=¢ 74241 
=1l-e?. 
Since the graph of f lies above the x-axis for 
0 <2 < 2, it follows that the area between the 
graph i f and the x-axis from x = 0 to x = 2 is 
l-—e™. 
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Similarly, the signed area between the graph 
of f and the z-axis from x = 2 to x = 4 is given 
by 


fe — 2)(e"® — 1) dz 


= —2e-+ 8+ | e "4 ja? 
2 
=e — 8 ie + 8) — (—e7? + 2) 
-3e + e? -2. 


Since the graph of f lies below the x-axis for 
2 < x < 4, it follows that the area between the 
graph of f and the z-axis from x = 2 tox = 4 is 


(ae se? a E eA, 
So the total area between the graph of f and 
the z-axis from x = 0 to x = 4 is 

(1 — e7?) + (3e74 — e7? + 2) 

= 3-2e7? + 3e74 

= 2.784 (to 4 s.f.). 


(The area found in this solution is shown below.) 


Solution to Activity 44 
(a) 7 cos? 0 d0 = f (1+ cos(20)) dé 


z sfa + cos(26)) 40 


$(0+ 4sin(20)) +c 
= 4(26 + sin(26)) + c. 
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b sing cosrdxz = | 4sin(2xr) dx 
(b) 2 
= $ x $(—cos(2z)) + c 


= —4 cos(2x) + c. 


Solution to Activity 45 


du 
Let u = sin z; then — = cos g. So 
daz 


[one cosrde = f udu 


= sur FÈ 
= i sin? z + c. 
(There’s an alternative approach to finding this 


integral, as follows. Let u = cos x; then 


du . 
— = — sinz. So 


dg 
[one cosrda = | cosrsine dx 


__ f L T 


=- Judu 


= —$u? FE 

= —4 cos? g + c. 
This approach gives a different answer, but of 
course it’s equivalent to the first answer. You can 
confirm this by using the identity 
sin? 0 + cos? 0 = 1, as follows: 


ae ee! 2 
= 60s z +c = —5(l— sinf s) ae 
= lány i 
= 57sin"@—5 te 
= {sinņ°z +d, 
where d = —4 + c is an arbitrary constant. 


Notice also that the solution to Activity 44(b) gives 
a third different answer: 


[one cos x dx = —}cos(2zx) + c. 


You can confirm that this answer is equivalent to 
the first answer above by using the identity 
cos(20) = 1 — 2sin? 0, as follows: 
—4{ cos(2x) +c = —}(1 — 2sin? x) + c 
= i sin? z — + +c 
= i sin? z + d, 


where d = —} +c is an arbitrary constant.) 


(The diagram below shows the graphs of the three 
different antiderivatives of f(x) = sin x cosx that 
were found in the solutions to Activity 44(b) and 
Activity 45. The arbitrary constant has been taken 
to be zero in each case. You can see that, as you’d 
expect, the three graphs appear to be vertical 
translations of each other.) 


Solution to Activity 46 


(a) Since the derivative of 9x4 is 362°, which is 
‘nearly’ x, you can use the substitution 
u = 9x". 


d 
Let u = 92+; then Z. = 3623. So 
dx 


36 cos(9a*) x 3623 dx 


i 
5 f cosudu 


=d a 
= 3g SInu +c 


= 4 sin(9x4) + c. 


fe cos(9x*) dz = + 


(b) The integrand is of the form xg(x), where g is a 
function that you can integrate, which suggests 
that you should try integration by parts. To 
integrate the ‘second’ expression, you can use 
the fact that it is a simple function of a linear 
expression. 


f cos(x) dx = $z sin(5x) — 2 f sm(52) dx 


= tr sin(5x) — 4 (—#cos(5z)) + c 


= trz sin(5x) + + cos(5z) + c. 


(c) The integrand is the sum of two expressions 
that are straightforward to integrate. The 


Solutions to activities 


second expression is a simple function of a 
linear expression. 


fe + cos(5a)) dz = $27 + t sin(5x) + c. 


(d) You could use integration by substitution 
(taking u = x? + 3) or integration by parts, but 
the simplest method is to start by multiplying 
out the brackets. 


Ja +3)dz 


[Go +32) ae 

ee ee 

= G0 +5% +c 

= 4x? (£? +6) +e. 


(e) The integrand can be written as a 
1+ (v3 x) 

so it’s a function of the linear expression v3 z. 
So you can use the substitution u = V3 x, 


1 
together with the indefinite integral of ——, 
14+ 2? 
which is a standard integral. 


Let u = v3 z; then du/dx = V3. So 


1 1 1 
eee ae ae pune 


e i 4 
(3 i+w 


1 =j 
= — tan u+c 


v3 


1 
= — tan”! (v3z) +c. 
V3 
(£) Since the derivative of 7 — x? is —3x?, which is 
‘nearly’ the numerator x7, you can use the 

substitution u = 7 — 2°. 


d 
Let u = 7 — x; then AU = —322. So 
dz 


| 3 | veo 
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(g) The integrand is of the form xg(x), where g is a 
function that you can integrate, which suggests 


that you should try integration by parts. To 
integrate the ‘second’ expression, you can use 
the fact that it’s a simple function of a linear 
expression. 


[oor daz 
= (—3e-*/9) — E x (—3e=/) dz 


= —3ge 77/3 + gje dz 
= —3re~*/3 — ge“*/3 +c 
3e? (x +3) +c. 


(h) You can use the trigonometric identity 
sin(20) = 2 sin 0 cos 0. 


[sm ($2) cos ($2) dx = fisnzar 


= 5 fsmzar 


= —4 cosg + c. 


(i) You can use the fact that the integrand is a 
simple function of a linear expression, or you 
can use the substitution u = x — 1. 


fe- nta = e 


(j) You could use integration by substitution 
(taking u = 1 — 2e”), but the simplest method 
is to start by multiplying out the brackets. 


fea 96") de = fe — 2(e”)?) da 
je- 


| 
a 
8 
| 
bo 
x 
IR 
a 
N 
È 
EA 
o 


| 
a 
R 
| 
a 
bo 
8 
ae 
o 


= e*(1—e*) +c. 
(Here’s how the integration goes if you use 
integration by substitution. 
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Let u = 1 — 2e*; then ou = —2e”. So 


dx 
[ea-2")ar = sfa 2e*)(—2e) dx 


= —7(1 — 2e”)? +e 
= —7(1 — de® + 4(e*)*) +e 
= —4(1 — 4e” + de") +c 


= e”(1 — e) +d, 
where d = —4 + c is an arbitrary constant.) 


(k) You can start by multiplying out the brackets 
and applying the sum rule. Then you have two 
integrals to find. One of them is 


straightforward, and you can find the other by 


using integration by substitution, taking u = 2”. 


f aitanta jdr 
Sh + asin(x”)) dx 


= fear+ f esin(a)d 


= $x? + Sethe. 


Let u = x°; then du/dx = 2x. The expression 
above becomes 


ir? +4 [tin(e?)) x 2rdr 
= ir? +} fsinudu 

= $27 + $(—cosu) +c 

z? — 4 cos(x?) +c 


(xz? — cos(z”)) + c. 


(1) Since the derivative of 32? — 2 is 6x, which is 
‘nearly’ x, you can use the substitution 
u = 3z? — 2. 


d 
Let u = 3x7 — 2; then Tu = 62. So 
dg 


feer — 2)? dz = ifer — 2)’ x 6x dz 


= 4 (827 — 2)? + c. 


Solution to Activity 48 


(a) The domain of f is the interval [—1, co) 
(because the expression x + 1 is defined only 
when z is in this interval). 


(b) The graph of f is shown below. 


YA 
Nee _ 2? -10r +15 
1 Ea 
154 
1 
54 

J T 


(c) The z-intercepts of f are 1.84 and 8.16 (to 
3 s.f.). 


(The exact values are 5 + v10.) 
(d) The area between the graph of f and the 
z-axis, between the two x-intercepts, is 
8.162... 52 _ 107 + 15 
E A Tae yan seal = 


A computer gives the value of this expression as 
12.4 (to 3 s.f.). 


(Details of how to use the CAS for this activity 
are given in the Computer algebra guide, in the 


section ‘Computer methods for CAS activities 
in Books A—D’.) 


Solutions to activities 


Solution to Activity 49 


The area of the cross-section, in square centimetres, 


is 
1.5 
f iy zr? + 1dr. 
=| 
A computer gives the value of this definite integral 
as 0.939 (to 3 s.f.). So the volume of plastic 
required to make one metre of the edging is 
approximately 93.9 cm. 

(Details of how to use the CAS to find the indefinite 
integral are given in the Computer algebra guide, in 


the section ‘Computer methods for CAS activities 
in Books A—D’.) 


Solution to Activity 50 


The change in the displacement of the object from 
time t = 0 to time t = 10 is 


10 #2 
in | — | dt. 
f (5) 


A computer gives the value of this definite integral 
as 2.15 (to 2 d.p.). 

So the displacement of the object at time t = 10 is 
(6 + 2.15)m = 8.15 m, to the nearest centimetre. 
(Details of how to use the CAS to find the indefinite 
integral are given in the Computer algebra guide, in 


the section ‘Computer methods for CAS activities 
in Books A—D’.) 
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Unit 9 


Matrices 


Introduction 


This unit introduces mathematical objects called matrices (the singular 
form is matriz). At their simplest, matrices are arrays of numbers used to 
hold information in an orderly way. In particular, they provide a useful 
way of storing data that can be organised in rows and columns. Everyday 
examples of such data include transport timetables and tables holding 
survey results. 


In this unit you will discover that arrays of numbers can be thought of as 
‘multi-dimensional numbers’. For example, you can add, subtract and 
multiply them, just as you can add, subtract and multiply numbers. 
You’ve met a similar idea before: in Unit 5, you saw that column vectors 
are numerical entities made up of more than one number, and they can be 
added, subtracted and multiplied by scalars, for example. 


One advantage of arranging large amounts of numerical data in matrix 
form is precisely that a block of data can be treated as a single entity. 
Many problems that involve large amounts of numerical data are first 
reduced to matrix problems before being solved with a computer. So 
matrices are an important topic in mathematics, and find widespread use 
in mathematical modelling, as well as in computer graphics (where they 
are used to move and rotate objects), medical imaging, economic models, 
electrical networks, data encryption and many other applications. 


As a simple practical example, consider a publisher that supplies its books 
to a number of bookshops. The publisher’s sales department might collect 
the weekly order quantities for three of its bestselling books in a grid such 
as the one in Table 1. 


Table 1 A publisher’s order quantities for three books in a certain week 


Bestseller 1 | Bestseller 2 | Bestseller 3 
Bookshop 1 
Bookshop 2 
Bookshop 3 


Bookshop 4 


The information in Table 1 can be stored more succinctly by omitting the 
row and column headings and enclosing the data in brackets, if it is 
understood that each row corresponds to a bookshop and each column 
corresponds to a book: 


10 25 12 
5 10 5 
7 58 O 
10 8 10 


This is an example of a matrix. Information about order quantities can be 
collected every week for the life of the three books and stored in this form. 


Introduction 


A magic square (an example 
of an array of numbers) on 
the facade of the Sagrada 
Familia basilica in Barcelona 
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Pages from The nine chapters 


on the mathematical art 
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For example, the information for two consecutive weeks can be recorded in 
two separate matrices: 


10 25 12 15 30 20 
5 10 5 10 20 10 
7 5 0f’ 20 8 5 
10 8 10 15 0 10 


The order quantities for a fortnight can now be calculated by adding 
corresponding numbers in the two arrays, which gives 


10+15 25+30 12+ 20 25 55 32 
5+10 10+20 5+10 that is 15 30 15 
7+20 5+8 0+5 [° , 27 13 5 
10+15 8+0 10+10 25 8 20 


This calculation amounts to adding the two matrices, in a sense that will 
be made precise in Section 1. 


You might wonder what the advantage is in carrying out calculations in 
this way. After all, the total number of orders for each bookshop can be 
worked out in a straightforward way without resorting to matrices. In a 
real-life situation, however, a large publisher will supply hundreds of 
bookshops with hundreds of books. Calculating order quantities and total 
charges requires careful accounting, and it is precisely in this sort of 
situation that matrices become a useful way to store data (especially 
electronically) and carry out calculations with them. 


Matrices were first introduced in Western mathematics in their 
current form by the British mathematician Arthur Cayley 
(1821-1895). However, they appeared as early as 300-200 BC in the 
Chinese text The nine chapters on the mathematical art, where 
matrix methods are used to solve simultaneous equations. You'll meet 
a modern version of these methods in Section 4. 


There are other useful operations that can be carried out when data are 
stored in matrix form, such as the multiplication of matrices by single 
numbers, and the multiplication of two matrices. You’ll learn about these 
operations in Section 1, and see an application of matrix multiplication in 
Section 2. In Section 3 you’ll learn about matrix inverses, and in Section 4 
you'll meet a connection between matrices and systems of linear equations. 


1 Matrices and matrix operations 


1 Matrices and matrix operations 


In this section you'll meet matrices and standard matrix notation, and 
learn about matrix addition, multiplication of matrices by single numbers 
(known as scalar multiplication of matrices), matrix multiplication and 
matrix powers. 


1.1 Matrix notation and terminology 


A matrix (pronounced ‘may-tricks’) is a rectangular array of numbers, 
usually enclosed in brackets. Here are some examples: 


t 4 

e 2 6) € J A 

53 0 3 1 92 0 
Some texts use square brackets for matrices, like this: 


5 a} 


In this module we’ll always use round brackets. 


We often use capital letters to denote matrices. For example, we might 
write 


1 
z 4 ' ; 
1 2 —1 1 2 3 First rule of consulting: never, 
i (5 3 i) f= € i) ek eee never call this a “table”. 
22 0 We can bill four times as much 


In the text in this module, the capital letters that denote matrices are in when we call it a “matrix”. 


bold font, but you don’t need to do anything special when you handwrite 
them. In particular, you don’t need to underline them, unlike letters that 
denote vectors. 


The numbers in a matrix are called its elements (or its entries, in some 
texts). For example, matrices A and C above each have six elements, and 
matrix B has four elements. In this module, the elements of matrices are 

always real numbers. 


A horizontal line of numbers in a matrix is called a row, and a vertical line 
of numbers is called a column. For instance, in the matrix A above the 
first row is (1 2 —1) and the second row is (5 3 0). Likewise, the first 


column is G) the second column is (5) and the third column is E 


The matrix A has two rows and three columns: we say that it is a 2 x 3 
matrix. In general, a matrix with m rows and n columns is described as an 
m x n matrix, or a matrix of size m x n. The first number in this 
notation is always the number of rows, and the second number is always 
the number of columns. The size m x n of a matrix is read as ‘m by n’. 


A matrix with the same number of rows as columns is called a square 
matrix. Thus, for example, matrix B above is square. 
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The column notation for vectors that you met in Unit 5 is a particular 
instance of matrix notation: two-dimensional column vectors are 2 x 1 
matrices, and three-dimensional column vectors are 3 x 1 matrices. In 
general, the word vector is used to mean any matrix with a single column, 
even if it has more than 3 elements and hence has no geometric 
interpretation in the plane or three-dimensional space. For instance, the 
following matrices are vectors: 


, where w, x, y, z are variables. 


reese 


The elements of a vector are often called its components, as in Unit 5. 
A vector with n components is called an n-dimensional vector. 


There is a useful notation for the elements of a matrix in terms of their 
row and column positions. The element in row i and column j of a 
matrix A is denoted by a;;. Similarly, the element in row 7 and column j 
of a matrix B is denoted by b;j, and so on. 


For instance, if 


I 
a=(3 3 
7 -6}’ 


then ay, = $, a12 = 4, a21 = 7 and a22 = —6. 


This notation is used for matrices of any size. Thus, for instance, a general 
3 x 4 matrix can be denoted by 


a11 412 Q13 Q14 
A= | a2 a2 a23 a24 
a31 Q32 Q33 434 


Activity 1 Using matrix notation 


Write down the elements a33, 434, 443, G14, G44 and agg of the matrix 


2 8 0 =l 


4 7 5 -2 
A= 9 -1 3 —4 
6 —5 =7 =9 


What is the size of A? 


Two matrices are equal if they have the same numbers of rows and 
columns, and all corresponding elements are equal. For example, 


G i)=G 4) 


1 Matrices and matrix operations 


but 
1 2 
1 2 1 2 1 3 
Gapa a (3 aea) 


You can carry out operations on matrices in a similar way to numbers. For 
example, you can add, subtract and multiply them, as you'll see in this 
section. There’s also a way to manipulate matrices that in certain 
circumstances plays the role of division, which you'll learn about in 
Section 3. While matrix addition is similar to addition of vectors, which 
you met in Unit 5, matrix multiplication is more surprising. We’ll start by 
looking at matrix addition. 


Arthur Cayley gave the rules for matrix operations in a paper 
presented to the Royal Society in 1858. The paper also contained the 
conditions under which a matrix has an inverse (you'll meet matrix 
inverses in Section 3). However, it was Cayley’s friend and fellow 
mathematician James Joseph Sylvester who coined the term ‘matrix’, 
in 1850, and he also contributed to the theory of matrices. 


Before being appointed to a mathematics professorship at the 
University of Cambridge, Cayley worked for 14 years as a lawyer in 
London. At the time, Sylvester was employed as an actuary. The two 
worked together at the courts of Lincoln’s Inn, which gave them the 
opportunity to discuss mathematics. 


Sylvester had a more varied mathematical career than Cayley, 
including professorships at University College London, Johns Hopkins 
University in the USA, and the University of Oxford. It is said that 
he tutored Florence Nightingale in mathematics, though firm evidence 
for this has yet to be found. 


Arthur Cayley 
(1821-1895) 


1.2 Matrix addition and subtraction 


You can add two matrices only if they have the same size. The sum of two 
matrices is obtained by adding their corresponding elements. For example: 


=). 4 3 1 -2+3 4+1 1 5 
i 5)+(-2 10) =(14+(-2) 5+10]=|-1 15 
0 -3 —6 4 Gita) -3+4 6 1 


The rule for adding two matrices can be expressed formally as follows. 


James Joseph Sylvester 
(1814-1897) 
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Matrix addition 


If A and B are m x n matrices, then A + B is the m x n matrix 
whose element in row 7 and column j is aij + bij. 


Addition of two matrices of different sizes is not defined. 


For instance, you cannot add a 3 x 3 matrix and a 3 x 5 matrix. 


The rule for adding matrices extends to the sum of any number of matrices 
of the same size: you just add corresponding elements. 


Activity 2 Adding matrices 


For each of the pairs of matrices below, work out the matrix sum A + B. 


wan’). 3-69 


In Unit 5, you saw that vector addition shares many properties with the 
addition of numbers. This happens for matrix addition, too. For example, 
you can see from the definition of matrix addition that 


A+B=B+HA, 


for all matrices A and B of the same size. As you saw with vectors, this 
property is expressed by saying that matrix addition is commutative. 


You can also see from the definition of matrix addition that 
(A+B)+C=A+(B+OC), 


for all matrices A, B and C of the same size. In other words, when you 
add three matrices A, B and C of the same size, it doesn’t matter whether 
you add A and B and then add C to the result, or add B and C and then 
add A to the result, as you get the same overall result either way. This 
property is expressed by saying that matrix addition is associative. It 
tells you that you can write the expression 


A+B+C 


without ambiguity: it doesn’t matter whether it means (A + B) + C or 
A + (B + C), as both are equal. 


There are other respects in which matrix operations behave like number 
operations. You will discover more similarities between matrices and 
numbers as you meet other matrix operations. 


1 Matrices and matrix operations 


You can also subtract matrices of the same size by subtracting their 
corresponding elements. For example, 


4 5 8 13 4\ (4-1 5-3 3-4 

7 —6 0) \3 9 1)  \F=-3 -—6-9 0-1 
_ 2 iI 
AA =15 SR 


Here is the formal definition of matrix subtraction. 


Matrix subtraction 


If A and B are m x n matrices, then A — B is the m x n matrix 
whose element in row ¿i and column j is aij — bij. 


Subtraction of two matrices of different sizes is not defined. 


Like the subtraction of numbers, matrix subtraction is neither 
commutative nor associative. For example, it is not true that 

A —B=B- A for all matrices A and B of the same size, as illustrated 
by parts (a) and (b) of the next activity. 


Activity 3 Subtracting matrices 


= 1 4 — 
Let A = Ta ,B= 2 and C = T an . Work out the 
1 0 =f 2 T 0 


following matrices. 
(a) A-B (b) B-A (c) B-C 


Since vectors are single-column matrices, they are added and subtracted 
by adding or subtracting corresponding components, as in the examples 
below. 


1 7 1+7 8 
-8 -2 =ne 5 
Ay + i = 4+ 1 = 5 
1 3 1 3 
3 3 3+5 2 
0 = 04 (-1) il 
10 = 10 — (—2) 12 
a) ft ah) 3+3 Juf 0 
1 ol | te Tel 
= 1 =i=} =2 


In other words, the ideas of addition and subtraction of column vectors 
that you met in Unit 5 extend in the obvious way to vectors with more 
than three components. 
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A matrix each of whose elements is zero is called a zero matrix, and is 
usually denoted by 0. For example, the 2 x 2 zero matrix is 


o= (8 9. 


Also, as you’d expect, if A is any matrix, then the matrix formed by 
changing each element of A to its negative is called the negative of A, 
and denoted by —A. For example, 


0 1\_/0 -i 
—\-2 4) R 4j’ 
It follows from the definitions of matrix addition and subtraction that for 
any matrices A and B of the same size, 
A-B=A+(-B), 


as you’d expect. 


1.3 Scalar multiplication of matrices 


Any matrix can be added to itself, and a matrix sum A + A can be 
written as 2A. Each element in 2A is twice the corresponding element 
in A, as in the following example: 


1 05 1 05 2 0 10 
=) = 5) Sat <8 of] | 3 -4 ö 
S 2 ji s- 2 +t 6 1 2 


2 
Similarly, adding A to itself n times, for any natural number n, gives a 
matrix in which each element is the corresponding element in A multiplied 
by n. 


This idea can be generalised to any real number k, so kA is the matrix 
obtained by multiplying each element of A by k. For example, 


at | = 5x4 4x3 R 3 
2.0) = 5x0 5x (-1) 0-3) 
The matrix kA is called the scalar multiple of the matrix A by the real 


number k, and the operation of forming kA is called scalar 
multiplication. 


Scalar multiplication 


If A is an m x n matrix and k is any real number, then kA is the 
matrix whose element in row 7 and column j is kaij. 


The number k in a scalar multiple kA of a matrix A is sometimes referred 
to as the scalar k in this context, to emphasise that it’s a number rather 
than a matrix. 


1 Matrices and matrix operations 


Activity 4 Multiplying matrices by scalars 


(a) Calculate the following products of scalars and matrices, simplifying 
your answers. 


(i) (3 J (ii) zG 4) (iii) 5 (3 n) 


(b) Simplify each of the following matrices by writing it as a scalar 
multiple of a matrix with integer elements. 


i) 3 Gi) () (iii) & sr) 


Notice that it follows from the definition of scalar multiplication that, for 
any matrix A, 


(-1)A = —A, 
as you’d expect. 


In the next activity you can practise the matrix operations that you’ve 
learned so far. 


Activity 5 Combining matrix addition, subtraction and scalar 
multiplication 


i 12 i 2 0 i 
Let A= | ? ,B=| ? andC=| ? 
4 —2 0 = 9 zi 3 $ 


Where possible, evaluate A — C, B + 2A and C — 2A. 


Here’s a summary of some useful properties of matrix addition and scalar 
multiplication of matrices. As you’d expect (since vectors are single-column 
matrices), these properties are the same as the properties of vector algebra 
that you met in Unit 5, but with vectors replaced by matrices. 
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Properties of matrix addition and scalar multiplication 


The following properties hold for all matrices A, B and C for which 
the sums mentioned are defined, and for all scalars m and n. 


A+B=B+A 
(A+B)+C=A+(B+C) 
A+0=A 

A+(-—A)=0 
m(A+B)=mA+mB 
(m+n)A=mA+nA 
m(nA) = (mn)A 

1A=A 


Sa e tse Ow 


a 


In properties 3 and 4, 0 is the zero matrix of the same size as A.) 


1.4 Matrix multiplication 


In Subsection 1.3 you learned how to multiply a number and a matrix. 
Under certain conditions, it is also possible to multiply two matrices. The 
way that this is done is less obvious than matrix addition or scalar 
multiplication. 


Recall the example in the introduction to this unit, where a matrix was 
used to store the numbers of copies of three bestselling books that four 
bookshops ordered from a publisher in a certain week: 


Bestseller 1 Bestseller 2 Bestseller 3 


Bookshop 1 10 25 12 
Bookshop 2 5 10 5 
Bookshop 3 7 5 0 
Bookshop 4 10 8 10 


Now suppose that the publisher charges the bookshops £4 per copy for 
Bestseller 1, £7 per copy for Bestseller 2 and £9 per copy for Bestseller 3. 
According to the data in the matrix, Bookshop 1 ordered 10 copies of 
Bestseller 1, 25 copies of Bestseller 2 and 12 copies of Bestseller 3. 
Therefore the total amount, in £, that Bookshop 1 owes the publisher for 
that week’s order is 


10x 4+25x 7+12 x 9 = 323. 


The expression on the left of this equation is obtained by multiplying each 
element in the first row of the matrix by the price of the corresponding 
book, and then adding the results. 


1 Matrices and matrix operations 


If we write the prices (in £) for the three bestsellers as a three-dimensional 
vector, that is, as a 3 x 1 matrix, then we can picture this procedure as 
shown in Figure 1. 


gives 10x 4+25x 74+12x9 


Figure 1 The procedure for calculating the amount owed by Bookshop 1 


If we want to calculate the amount owed by Bookshop 2, then we can carry 
out the same procedure with the second row of the matrix, and likewise for 
Bookshops 3 and 4 with the third and fourth rows, respectively. We can 
write the four amounts that we obtain as a four-dimensional vector. The 
procedure described above combines the two matrices 


10 25 12 4 
oS and 7 

7 5 O 9 

10 8 10 

to give 

10x 4+25x7+12x9 323 
5x4+10x7+5x9 _ {| 135 
7x44+5x7+0x9 ~ | 63 
10x4+8x7+10x9 186 


This calculation shows that the amounts owed by the four bookshops are 
£323, £135, £63 and £186, respectively. 


This procedure for combining matrices comes up so often in applications 
that it is used to define matrix multiplication. We say that 


10 25 12 4 323 

5 10 5 135 
the product of 7 5 0 and : is 63 

10 8 10 186 
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Now suppose that as well as calculating the amount owed by each of the 
four bookshops, the publisher also wants to calculate the profit made from 
sales to each individual bookshop. Suppose that the profits made on each 
copy of Bestsellers 1, 2 and 3 are £1, £2 and £2, respectively. You can use 
exactly the same procedure as above to calculate the profit made from the 
sales to each bookshop. You write the profits (in £) made on the three 
books as a three-dimensional vector, and form 


10 25 12 


1 
the product of Ta and 2], 
7T & 0 2 
10 8 10 
which is 
10 x 1+25x2+12x2 84 
5x1+10x2+5x2 |_ [35 
7x1+5x2+0x2 17 
10x1+8x2+10x2 46 


So the profits made from sales to the four bookshops are £84, £35, £17 
and £46, respectively. 


In fact you can view the two matrix multiplications above — the one for the 
amounts owed, and the one for the profits — as a single matrix 
multiplication. To do this, you put the two vectors containing the prices 
and the profits together as a single matrix with two columns, and you put 
the two vectors that contain the results of the calculations together as a 
single matrix with two columns, and say that 


10 25 12 4] 323 84 
5 10 5 135 35 

the product of 7 5 0 and i 63 17l” (1) 
10 8 10 186 46 


This calculation is an example of how matrices are multiplied in general. 
In Section 2 you’ll meet another context that shows that it makes sense to 
multiply matrices in this way. Before that, you’ll see a more detailed 
explanation of the procedure for multiplying matrices, and have a chance 
to practise it yourself. 


Let’s start by considering which pairs of matrices can be multiplied 
together. Remember that the element in the first row and first column of 
the product matrix in calculation (1) above was obtained by multiplying 
each element in the first row of the first matrix by the corresponding 
element in the first column of the second matrix, and adding the results: 


10x 4+25 x 74+12 x9 = 323. 


It was possible to do this because the number of columns in the first matrix 
is the same as the number of rows in the second matrix, so each element in 
the first row of the first matrix has a corresponding element in the first 
column of the second matrix. You can multiply two matrices together only 
if this condition holds. 


1 Matrices and matrix operations 


Thus, you can multiply together a 4 x 3 matrix and a 3 x 2 matrix. On 
the other hand, it is not possible to multiply together a 3 x 4 matrix and a 
2 x 3 matrix. When it is not possible to multiply two matrices, we 
sometimes say that their product is undefined. 


A convenient way to determine whether two matrices, of sizes m x n and 
p x q, say, can be multiplied together is to write their sizes next to each 
other as follows: 


mxn pxq. 


The matrices can be multiplied only if the two numbers in the middle are 
equal; that is, if n = p. If these numbers are equal, then the size of the 
product matrix is given by the remaining numbers. So the product of an 
m x n matrix and an n x q matrix is a matrix of size m x q. 


For example, a 4 x 3 matrix and a 3 x 2 matrix can be multiplied together, 
and the product has size 4 x 2. Here’s a useful way to picture this fact: 


axl 3 | [3 ]x2 gives 4x2. 


Now let’s look at the procedure for multiplying two matrices step by step. 
It’s demonstrated in the next example. 
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@®. To obtain the element in the first row and first column of AB, 
multiply each element in the first row of A by the corresponding 
element in the first column of B, and add the results. ©& 


232 J] 3 im 0 (-2)x1+1x $43 (-1) 
23 4 2 5 2| = 
0-1 0 im 4 
= 
2 


@. To obtain the element in the first row and second column of AB, 
apply the same procedure to the first row of A and the second column 


of B. # 
== 3 1g -4 (-2)x0+1x2+4+3x4 
2 4 -2 $2) = 
0—1 0 -1 8 


@. To obtain the element in the second row and first column of AB, 
apply the same procedure to the second row of A and the first column 


of B. # 
D a 3 W o 2 14 
E E itax tee. Gi 
=r i 4 


@. In general, to obtain the element in the ith row and jth column 
of AB, multiply each element in the ith row of A by the 
corresponding element in the jth row of B, and add the results. © 


2 1 3 170 
2 4 = 53 2 
oa o N a 
-5 14 
= 6 2x0+4x2+(—2)x4 
0x1+(-1)x4+0x(—1) 0x04+(-1)x2+0x4 
9 
-3 14 
={ 6 0 
1 
-3 =9 
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Here’s a summary of the procedure for matrix multiplication. 


Matrix multiplication 


Let A and B be matrices. Then the product matrix AB can be 
formed only if the number of columns of A is equal to the number of 
rows of B. 


If A has size m x n and B has size n x p, then the product AB has 
size m X p. 


The element in row ¿į and column j of the product matrix AB is 
obtained by multiplying each element in the ith row of A by the 
corresponding element in the jth column of B and adding the results. 


In element notation, if ci; denotes the element in the ith row and 
jth column of AB, then 


Cij = airbij + ainda +--+ + ainbnj. 


Figure 2 illustrates how row 7 of a matrix A and column j of a matrix B 
are combined to give the element in row ¿į and column j of the product 
matrix AB. 


column j 


Gg big + izboj +-+ + ie Ong 
Figure 2 The element in row 7 and column j of a product matrix AB 
expressed in terms of elements from A and B 


Matrix multiplication may seem quite complicated at first, but it is a very 
useful technique, and with practice you should become proficient at it. 
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Activity 6 Multiplying matrices 


In each of parts (a)—(e) below, calculate the matrix product if it exists. 


A 23 1 3 
(a) ce ee (b) (1 2 3) (8 
-6 8 7 
2 9 0 10 9 
(c) 7 i 4 1 (a) {8] @ 2 3) 
—6 8/ \-2 7 
EE Fo 
(e) (7 6 1)|1 9 -1 
2 1 3 


Matrix multiplication shares many of the properties of multiplication of 
numbers, as you'll see later in this subsection and later in the unit. 
However, there’s an important difference between the properties for 
matrices and those for numbers, as you can find out in the next activity. 


Activity 7 Investigating matrix multiplication 


(a) Determine whether the products AB and BA are defined for the 
following pairs of matrices. 


(b) For each of parts (a)(i) and (a)(ii), if the products AB and BA are 
both defined, evaluate them. What do you notice? 


Activity 7 illustrates the important fact that matrix multiplication is not 
commutative. First, there are matrices A and B for which the product AB 
exists but the product BA is not defined. Second, even in cases where 
both the products AB and BA are defined, these two products can be 
different matrices. 


In the next activity, you can investigate whether another property that 
holds for multiplication of numbers also holds for multiplication of 
matrices. 


1 Matrices and matrix operations 


Activity 8 /nvestigating matrix multiplication further 


Let A = (-1 0), B= (5 = ond C= (4 S ie 


(a) What are the sizes of the products AB and BC? Calculate these 
matrices. 


(b) Do the products (AB)C and A(BC) exist? If so, calculate them. 
What do you notice? 


In Activity 8 you should have found that, for the particular three matrices 
A,B and C in the activity, (AB)C = A(BC). This does not of course tell 
you that this property holds for matrices in general. However, this 
property does hold in general: whenever A, B and C are matrices such 
that the products AB and BC are defined, we have 


(AB)C = A(BC). 


In other words, matrix multiplication is associative. In this respect matrix 
multiplication behaves like multiplication of numbers. 


In the next activity you can investigate how matrix multiplication 
interacts with multiplication by a scalar. 


Activity 9 Combining matrix multiplication and multiplication by a 
scalar 


1 2 1 0 
Let A = é 7 and B = € a Show that 


A(2B) = 2(AB). 


In Activity 9 you verified a particular instance of a general property of 
matrices: if the matrix product AB exists, then, for any scalar k, 


A(kB) = (kA)B = k(AB). 


In the next activity you’re asked to investigate how matrix multiplication 
interacts with matrix addition. 
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Activity 10 Combining matrix addition and multiplication 


1 —1 1 —1 
Let A= (3 ee) and C= ( 7 


By calculating A(B + C) and AB + AC, show that 
A(B+C)=AB+AC. 


In Activity 10 you verified a particular instance of another general 
property of matrices: if the matrix product A(B + C) can be formed, then 


A(B+C)=AB+AC. 
It’s also true that if the matrix product (A + B)C can be formed, then 
(A +B)C = AC+BC. 
In other words, matrix multiplication is distributive over matrix addition. 
So, for example, if you encounter a matrix expression of the form 
AB+ AC, 
then you can factorise it to give 


AB+AC=A(B+C). 


Here’s a summary of the properties of matrix multiplication that you’ve 
seen so far. 


Some properties of matrix multiplication 


The following properties hold for all matrices A, B and C for which 
the products and sums mentioned are defined. 


e (AB)C=A(BC) 

e k(AB) = (kA)B = A(kB), for any scalar k 
e A(B+C)=AB+AC 

e (A+B)C=AC+BC 


Remember also the following important fact. 


Matrix multiplication is not commutative 


e There are matrices A, B such that the product AB exists but 
the product BA does not. 


e Even when both products are defined, it can happen that 
AB Z BA. 


1 Matrices and matrix operations 


Matrix powers 


Just as the square a? of a real number a is defined to be the product a x a, 
so we define the square A? of a matrix A to be the matrix product AA, 
if it exists. 


For example, if 


21 
a=(3 a): 


then 


Ba (Dy e 


In the next activity you’re asked to investigate which matrices can be 
squared. 


Activity 11 /nvestigating which matrices can be squared 


(a) Let M = r J . Why is it not possible to form the product MM? 
2 —1 
(b) Let N = (e 3! Is the product NN defined? If, so, evaluate it. 


(c) Describe which matrices can be squared. 


In Activity 11 you saw that if A is a square matrix (a matrix with the 
same number of rows as columns), then its square A? exists. The 
matrix A? has the same size as the original matrix A. 


Be careful not to confuse the two different concepts square matrix and 
square of a matriz. 


If A is a square matrix, then since A? is a square matrix of the same size 
as A, we can also form the products (A?)A and A(A?). Since matrix 
multiplication is associative, we know that 

A’?A = (AA)A = A(AA) = AA?. 


Hence either of these products can be written unambiguously as AAA. 
This matrix is called the cube of A and is denoted by A’. For example, 
for the matrix 


e 


discussed at the beginning of this subsection, we have 


syo aaa ft A DL A07 
aaach a) o) Gi s) 
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In general, for any square matrix A, the nth power of A is 


A” =AA.:-A. 
a 


n times 


Activity 12 Calculating matrix powers 


(a) Calculate the squares of the following matrices: 


(b) Let A = G Calculate AS. 


3 0 
Hint: the cube A? of this matrix A was calculated before the activity. 


Matrix multiplication with a computer 


Matrix multiplication can be time-consuming to carry out when the 
matrices involved are large. In the next activity you can find out how to 
use a computer to carry out matrix operations, including multiplication. 
This work will increase your familiarity with the way matrices behave, and 
help you distinguish between circumstances when it is worth using a 
computer to perform matrix calculations and when it is more efficient to 
do the calculations by hand. 


Activity 13 Working with matrices on a computer 


Work through Subsection 10.1 of the Computer algebra guide. 


2 Matrices and networks 


Matrices can be used to represent and analyse various flows in networks. A 
network is a collection of objects connected by links — either physical or 
abstract — such as cities connected by roads, telephone exchange points 
connected by telephone lines, or people connected by acquaintance. A 
foremost example is the internet, which is a network of communicating 
computers linked by a range of technologies. In this section you'll look at 
flows in networks of a particular type, and learn how to use matrices to 
work with them. 


2.1 Networks 


Figure 3 shows a simple example of a network of water pipes. Water can 
be poured into the system at position A or B, and it then flows down the 
pipes to come out at positions U, V and W. The pipes have different 
capacities: the three pipes that come out of position A are labelled with 
numbers that indicate the proportions of water that flow through these 
pipes when water is poured in at A, and the three pipes that come out of 
position B are labelled in the same way. For example, if a litre of water is 
poured in at position A, then 0.4 litres flows down the first pipe from A, 
0.2 litres flows down the second pipe from A, and 0.4 litres flows down the 
third pipe from A. 


Figure 3 A network of water pipes 


A network like this can be represented more simply as a network 
diagram, which is a mathematical representation of a physical network. 
The network diagram in Figure 4 represents the network in Figure 3. 


Figure 4 A network diagram representing the pipe system in Figure 3 


The dots in a network diagram are called nodes, and the lines are called 
arcs. The arrows on the arcs in Figure 4 are included because this network 
diagram represents a network in which flow along an arc takes place in 
only one direction. When no confusion arises, we use the word ‘network’ to 
mean either ‘network diagram’ or ‘physical network’. We’ll use network 
diagrams to represent networks from now on in this section. 
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Example 2 Calculating network outputs 


The network diagram in Figure 4 is repeated below. 


How much water is output at each of nodes U, V and W if the only 
input is 1 litre of water at node B? 


Solution 


®. The label on the arc joining node B to node U indicates the 
proportion of the water input at node B that reaches node U. ® 


The label on the arc from node B to node U is 0.3. So 0.3 litres of 
water is output at node U when 1 litre is input at node B. 


@. Read the labels on the other arcs from node B. & 


Similarly, 0.1 litres of water is output at node V, and 0.6 litres of 
water is output at node W. 


Now consider what happens if water is input at both node A and node B 
in the network in Example 2. The amount of water reaching an output 
node is the sum of the water reaching the node from node A and from 
node B. For example, if 1 litre of water is input at node A and another 

1 litre is input at node B, then node U will collect 0.4 litres of water from 
node A and 0.3 litres from node B. So, in total, 0.7 litres of water will be 
output at node U. Similarly, 0.2 + 0.1 = 0.3 litres of water will be output 
at node V, and 0.4+ 0.6 = 1 litre of water will be output at node W. 


In general, if x litres of water is input at node A and y litres of water is 
input at node B, then 


0.42 + 0.3y litres of water is output at node U, 
0.22 + 0.1y litres of water is output at node V, 
0.4x + 0.6y litres of water is output at node W. 


In the next activity you can practise calculating network outputs yourself. 
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Activity 14 Calculating network outputs 


The following network diagram represents a network of water pipes. 


U V W 


(a) How much water is output at nodes C and D if 1 litre of water is 
input at node U, 2 litres of water is input at node V, and there is no 
input at node W? 


(b) How much water is output at nodes C and D if x litres of water are 
input at node U, y litres of water is input at node V, and z litres of 
water is input at node W ? 


Network diagrams like those that you’ve seen in this subsection can be 
used to model many other kinds of flows, such as the movement of money 
in and out of companies. For example, consider three corporate donors, D, 
E and F, say, that every month make donations to two charities, J and K. 
Donor D donates 20% of its monthly designated sum to charity J and 80% 
to charity K, donor E donates 30% of its sum to charity J and 70% to 
charity K, and donor F donates 50% of its sum to charity J and 50% to 
charity K. 


The flow of money from the three donors to the two charities can be 
represented by the network in Figure 5. For example, a total donation of 
£1000 from donor D in a given month is modelled by an input of £1000 at 
node D. The label of the arc connecting D to J tells you that 0.2, or 20%, 
of that sum, which is £200, reaches the output node J. 


D E F 


Figure 5 A network diagram representing the flow of money from three 
corporate donors to two charities 
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Activity 15 Calculating more network outputs 


The network diagram in Figure 5 is repeated below. 


D E F 


It represents the flow of money from donors D, E and F to charities J 
and K, as described above. How much money is output at each of nodes J 
and K in each of the following situations? 


(a) The only input is £2000 at node D. 


(b) The input at node D is £2400, the input at node F is £1700, and the 
input at node F is £1100. 


(c) The input (in £) at node D is x, the input (in £) at node E is y, and 
the input (in £) at node F is z. 


2.2 From networks to matrices 


In this subsection you’ll see how network output calculations of the kinds 
that you saw in Subsection 2.1 can be expressed succinctly using matrices. 


Consider again the first network representing water pipes that you saw in 
Subsection 2.1. It is repeated in Figure 6. 


Figure 6 A network representing water pipes 


The amounts of water (in litres) input at nodes A and B can be 
represented as a two-dimensional vector (a 2 x 1 matrix), whose first and 
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second components are the inputs at A and B, respectively. For example, 
the vector 


0 
1 
represents 0 litres of water input at node A and 1 litre of water input at 


node B. 


The outputs from the network can also be represented by a vector. For 
example, the vector 


0.4 
0.2 
0.4 


represents an output of 0.4 litres of water at node U, 0.2 litres at node V, 
and 0.4 litres at node W. 


The network itself can be represented by the matrix 


0.4 0.3 
0.2 0.1 
0.4 0.6 


Here the elements in each column are the labels of the arcs from a node 
(A or B), and the elements in each row are the labels of the arcs to a node 
(U, V or W), as shown below: 


A B 
U /04 0.3 
V {02 0.1 
W \0.4 0.6 


For example, the element in the column labelled B and the row labelled U 
is the label of the arc from node B to node U. You'll see the reason for 
arranging the arc labels in this way in a moment. 


You saw in Subsection 2.1 that if x litres of water is input at node A and 
y litres of water is input at node B, then 


0.42 + 0.3y litres of water is output at node U, 
0.27 + 0.1y litres of water is output at node V, 


0.4x + 0.6y litres of water is output at node W. 


In other words, 


r 0.4z + 0.3y 
if the input vector is ( ) , then the output vector is | 0.2x + 0.ly 
Y 0.42 + 0.6y 


This output vector can be expressed as the product of the matrix that 
represents the network and the input vector: 


0.42 + 0.3y 0.4 0.3\ / 
0.22+01y)— [02 0.1 ( ) 
0.4 + 0.6y 0.4 0.6) Y 
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t think those water 
engineers may have used 
the wrong matrices. 


So we can say that 


0.4 0.3 
if the input vector is (5) , then the output vector is | 0.2 0.1 (*) ; 
4 0.4 0.6) Y 


So, for any input vector, you can work out the corresponding output vector 
by multiplying the input vector by the matrix that represents the network. 
This is illustrated in the next example. 


Example 3 Using matrices to calculate network outputs 


Use the matrix that represents the network diagram in Figure 6 to 
calculate the outputs at nodes U, V and W if 2 litres of water is 
input at node A and there is no input at node B. 


Solution 


@. The matrix representing the network was found before this 
example. ® 


The matrix representing the network is 


0.4 0.3 
O2 OIL 
0.4 0.6 


The input vector (in litres) is 


2 
0): 
@. The output vector is the product of the matrix representing the 


network and the input vector. ® 


Hence the output vector is 


OA Os 9 0.4 x 2+0.3 x 0 0.8 
02 0l (a) = |02 x 24-01% 0] = || O4 
0.4 0.6 0.4 x 2+0.6 x 0 0.8 


So the outputs at nodes U, V and W are 0.8 litres, 0.4 litres and 
0.8 litres, respectively. 


In general, to represent a network by a matrix in the way described above, 
you need a column for each input node and a row for each output node. So 
if the network has n input nodes and m output nodes, then the matrix has 
size M X n. 


When you find a matrix representing a network of the type considered in 
this section, it’s useful to check that the elements in each column have 
sum 1. The matrix must have this property because the elements in each 


column are the labels on an arc from a particular node. You can see that 
the matrix representing the network in Example 3 has this property. 


Activity 16 Using matrices to calculate network outputs 


The network in Activity 14, which represents a network of water pipes, is 
shown again below. 


(a) Write down the matrix that represents this network. 


(b) Write down a vector that represents an input of 1 litre of water at each 
of nodes U, V and W. 


(c) Use your answer to part (a) to calculate the output vector for the 
input vector in part (b), and interpret your answer in terms of the 
amounts of water output at nodes C and D. 


2.3 Combining networks 


Networks can be combined by making the outputs from one network 
become the inputs to another network. In this section you’ll see how to use 
matrices to calculate the outputs from a combined network using the 
information about the original networks. 


The network diagram in Figure 4 and the one in Activity 14 are repeated 
in Figure 7. Both diagrams represent networks of water pipes. 


Figure 7 Two network diagrams 
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The first network has three output nodes, and the second network has 
three input nodes. We can join these nodes so that, for instance, the water 
output at U in the first network flows down the pipes that leave U in the 
second network. The combined network is shown in Figure 8. 


Figure 8 The combined network 


The network obtained in this way can be considered as equivalent to a 
simpler network with arcs going directly from the input nodes A and B to 
the output nodes C and D, as illustrated in Figure 9. In general, we 
regard two networks of the type considered in this section as equivalent if 
they have the same input and output nodes, and all choices of inputs 
produce the same outputs in both networks. 


A B 


C D 


Figure 9 A network equivalent to the one in Figure 8 


How can we find the labels for the arcs in the new, simpler network 
diagram? Here’s how you can find the label for the arc from A to C, for 
example. 

Suppose that 1 litre of water is input at node A in the combined network 


diagram, and there are no other inputs. Then, from the arc labels in the 
top half of the diagram, you know that the amounts of water passing 


through nodes U, V and W are 0.4, 0.2 and 0.4 litres, respectively. So, 
from the arc labels in the bottom half of the diagram, you can work out 
the following. 


e Of the 0.4 litres of water that passes through node U, the amount that 
reaches node C is 0.2 x 0.4 = 0.08 litres. 


e Of the 0.2 litres of water that passes through node V, the amount that 
reaches node C is 0.6 x 0.2 = 0.12 litres. 


e Of the 0.4 litres of water that passes through node W, the amount 
that reaches node C is 0.3 x 0.4 = 0.12 litres. 


To find the total amount of water output at node C, we add the 
contributions from the three different routes, which gives 

0.08 + 0.12 + 0.12 = 0.32 litres. So the proportion of the water input at 
node A that’s output at node C is 0.32, and hence the arc from A to C 
should be labelled with the number 0.32. 


You can find the labels for the remaining arcs in a similar way. However, 
these calculations are quite tedious, and it turns out that they can be 
simplified by using the matrices that represent the two original networks, 
which are 


ame 0.2 0.6 0.3 
0.2 0.1 and 08 04 0.7)? 
0.4 0.6 ` ` f 


respectively. These matrices were found in Subsection 2.2. 


Suppose that the input and output vectors (in litres) for the combined 
network in Figure 8 are 


(3) ot (): 


respectively. Suppose also that the output vector (in litres) of the top part 
of the network, which is also the input vector for the bottom part, is 


U 
U 
w 
Then 
a 04 as (:) m = 0.6 o : 
v a a) T \o8 04 0.7) |? 


Using the first of these equations to substitute in the second gives 


i 0.6 2 y K (a) 
d 0.8 0.4 0.7 JA 06 b 


You know from Section 1 that matrix multiplication is associative, so 


Pa 0.6 a ae GE 
d 0.8 04 0.7) \ a4 og b 
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that is, 


E (2) 


The conclusion of this discussion is that the matrix that represents the 
combined network in Figure 9 is the product of the matrices that represent 
the original networks. So matrix multiplication provides a neat way to 
calculate outputs of the combined network. 


Notice that the matrix representing the top network is the second matrix 
in the product that represents the combined network. 


The labels of the simpler network that is equivalent to the combined 
network can be read off the product matrix. This is easier if we label the 
rows and the columns with the nodes as below: 


A B 
C (0.32 0.3 
D \0.68 0.7) ° 
Notice that each column of this matrix has sum 1, as expected in view of 


the comment after Example 3. 


The original combined network and the equivalent simpler network, with 
the arc labels, are shown in Figure 10. 


Figure 10 The two equivalent networks 


Activity 17 Finding the matrix representing a combined network 


The network below is the combination of two smaller networks, one with 
input nodes U and V and output nodes P and Q, and the other with input 
nodes P and Q and output nodes A, B, and C. 


(a) 


U V 


Write down the matrix equation that gives the output vector t ) at 
nodes P and Q when the input vector is (*) at nodes U and V. 


a 
Write down the matrix equation that gives the output vector | b | at 
c 


nodes A, B and C when the input vector is r ) at nodes P and Q. 


Use your answers to parts (a) and (b) to find the matrix equation for 
the combined network when the input vector is j at nodes U 
and V. 


Draw a network with input nodes U and V and output nodes A, B 
and C that is equivalent to the given network. 


Suppose that the given network represents a network of water pipes. 
Determine the outputs at nodes A, B and C when 1 litre of water is 
input at U and 2 litres of water is input at V. 


Joining networks is also useful in other contexts. For example, Figure 11 
repeats the network diagram from Figure 5, which represents the flow of 
money from three corporate donors D, E and F to two charities J and K. 
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Figure 11 A network diagram representing the flow of money from three 
corporate donors to two charities 


Suppose that donors D, E and F are the only donors to the two charities, 
and that the two charities spend their money in two different geographical 
regions, R and S, as indicated in Figure 12. So, for example, charity J 
spends 60% of its money in region R and 40% in region S. 


J K 


0.6 0.1 


R S 


Figure 12 A network diagram representing the flow of money from two 
charities into two regions 


In the next activity you’re asked to combine the networks in Figures 11 
and 12 to obtain a network that represents the flow of money from the 
three corporate donors to the two regions. 


Activity 18 Finding the matrix representing another combined 
network 


(a) Write down the matrices that represent the networks in Figures 11 
and 12. 


(b) Hence find the matrix that represents the network obtained by 
combining these two networks. 


(c) Draw a simplified network that represents the combined network 
described in part (b). 


(d) If, in a particular month, donor D donates £1700, donor E donates 
£1400 and donor F donates nothing, how much money is spent in 
region S? 


3 The inverse of a matrix 


In a real-life situation, networks like the one in Activity 18 would have 
many more input and output nodes, and the matrices representing them 
would be manipulated on a computer. 


3 The inverse of a matrix 


In Section 1 you met four matrix operations: addition, multiplication by a 
scalar, subtraction and matrix multiplication. No division was mentioned: 
it is not possible to divide a matrix by another matrix. However there is a 
way to manipulate matrices that in certain situations plays the role of 
division. To understand this method, you need to know about special 
matrices called identity matrices, which are introduced next. 


3.1 Identity matrices 


The number 1 has a special property among real numbers: multiplication 
by 1 leaves any real number unchanged. An identity matrix is a matrix 
that behaves like the number 1, in the sense that if a matrix A is 
multiplied by an identity matrix of an appropriate size then the result is 
again A. You will meet an identity matrix in the next activity. 


Activity 19 Multiplying by an identity matrix 


3 4 2 7 -3 1 0 
Let A = _9 tee 1 >) and t= (1 1). Evaluate each of 


the following products, where possible: 


AI, IA, BI, IB. 


The matrix I in Activity 19 has the following two properties. 

e If A is any matrix such that the product AI exists, then AI = A. 
e If A is any matrix such that the product IA exists, then IA = A. 
Any matrix I with these two properties is called an identity matrix. 


For a matrix to be an identity matrix, it has to be square. To see this, 
suppose that A is an m x n matrix. Then in order for the product AI to 
be defined, I must have n rows. Moreover, in order for AI to have size 
m x n (which it must if it is to be equal to A), I must have n columns: 


mx|n | times [n]xn gives m xn. 


So I must have size n x n. 
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It’s straightforward to verify that the following 2 x 2, 3 x 3 and 4 x 4 
matrices are identity matrices: 


1 0 0 0 
1 0 Te 0 1 0 0 
017’ \o o 1?) |0010 
0 0 0 1 


In fact, for every natural number n, the n x n matrix that has ones down 
the leading diagonal (the diagonal that starts at the top left element and 
ends at the bottom right element), and zeros everywhere else, is an 
identity matrix. 


Furthermore, for every natural number n, this matrix is the only n x n 
identity matrix. To see this, let I be the n x n identity matrix described 
above, and let E be an n x n identity matrix. Then IE = I and IE= E, 
so I= E. 


When we’re working with matrices, we usually reserve the letter I to 
denote the n x n identity matrix. 


Activity 20 Multiplying by an identity matrix 


x 


f : and B = | y | be general matrices, and let I be the 
z 


3 x 3 identity matrix. Verify that AI = A and IB = B. 


Let A = ( 


The facts that you’ve seen about identity matrices are summarised in the 
following box. 


Identity matrices 

An identity matrix is a square matrix I such that 

e for any matrix A for which the product AI is defined, AI = A 
e for any matrix A for which the product IA is defined, IA = A. 


Each identity matrix has the form 


l O O see 0 
O L O s 0 
~ O l ss O 
© O O ses i 


That is, it has ones down the leading diagonal and zeros elsewhere. 


3 The inverse of a matrix 


3.2 Matrix inverses 


As mentioned earlier, it isn’t possible to divide matrices, but it is 
sometimes possible to perform a manipulation with a similar effect. 


Remember that, in the arithmetic of numbers, dividing by a number is the 
same as multiplying by its reciprocal. For example, dividing by 2 is the 
same as multiplying by 5. The reciprocal of a non-zero number a is the 
number b such that ab = 1. 


There is a notion in matrix arithmetic that is analogous to the notion of a 
reciprocal in ordinary arithmetic. Let A be a square matrix. If there is 
another matrix B of the same size with the property that 


AB=I and BA =I, 


where I is an identity matrix, then we say that A is invertible and that B 
is an inverse of A. Multiplying by an inverse of a matrix is useful in many 
situations where you might want to use a matrix operation analogous to 


division. 
Activity 21 Checking matrix inverses t thought you told us 
to practise lnverting 
For each of the following pairs of matrices A and B, check that B is an mattresses! 


inverse of A, by finding the products AB and BA. 


2 3 1 0 1 =i 
b) A={-1 10], B={0 2 -1 
—2 1 0 1 -8 5 


If a matrix A is invertible, then it has exactly one inverse. To see this, 
suppose that the matrices B and C are both inverses of the matrix A. 
Then 


AB=I and BA =I, andalo AC =I and CA=lI. 

Now consider the matrix BAC. Since matrix multiplication is associative, 
BAC = (BA)C = IC =C, 

and also 
BAC = B(AC) = BI =B. 

Hence B and C are the same matrix. That is, A has only one inverse. 


We usually write the inverse of an invertible matrix A as A~!. So the 
following facts hold. 
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Inverse of a matrix 
If A is an invertible matrix, then 
AA =I and A™A =I, 


where I is the identity matrix of the same size as A. 


It follows from the definition of the inverse of a matrix that if A is an 
invertible matrix, then not only is A~! the inverse of A, but also A is the 
inverse of A~!. In other words, A and A~! are inverses of each other. 


In the rest of this subsection, you'll learn how to establish whether any 
particular 2 x 2 matrix has an inverse, and how to work out the inverse 
when it exists. You’ll also use a computer to calculate inverses of larger 
matrices. 


Inverses of 2 x 2 matrices 


For 2 x 2 matrices, there is a useful formula for finding inverses. 


Inverse of a 2 x 2 matrix 


The inverse of the 2 x 2 matrix te 3) is given by 


il = 
E. e , , provided that ad — bc Æ 0. 
In other words, to obtain the inverse when ad — bc 4 0, swap the two 
elements on the leading diagonal and multiply the other two elements 
by —1, then multiply the resulting matrix by the scalar 1/(ad — bc). 


You'll be asked to verify this general formula later in this subsection, but 
for now let’s check that it works for the matrix 


=G i) 


in Activity 21(a). Here ad — be = 8 x 2 — 5 x 3 = 1 Æ 0, so the scalar 
1/(ad — bc) is 1. You can see that the matrix 


ET 


in Activity 21 (a) is indeed obtained from A by swapping 8 and 2, 
multiplying the other two elements by —1, and multiplying the resulting 
matrix by the scalar 1. 


Notice that the box above includes the condition ad — be Æ 0. This is 
because the formula involves dividing by ad — bc, and division by 0 is not 
defined. If the elements of a 2 x 2 matrix do not satisfy this condition, 


then the matrix does not have an inverse. You'll see a proof of this fact in 
the next section. 


In fact the quantity ad — bc that appears in the formula is so important 
that it deserves a name, as follows. 


Determinant of a 2 x 2 matrix 


Let A = p 


determinant of A, and written as det A. 


. Then the number ad — bc is called the 


If the determinant of A is not zero, then A is invertible. 
If the determinant of A is zero, then A is not invertible. 


Some texts denote det A by |A|, which looks like the notation for the 

modulus of a real number, but has a different meaning in this context. 

This notation is also used when a matrix is written out in full; for example, 
| 2 1 


3 0 =2 x 0—1 x (-3). 


When seeking the inverse of a 2 x 2 matrix, you should first check that the 
matrix is invertible by calculating the determinant. If the determinant is 
not zero, then you can use the rule for finding the inverse. 


3 The inverse of a matrix 


255 


Unit 9 Matrices 


256 


You can practise finding the inverses of 2 x 2 matrices in the next activity. 


Activity 22 Finding inverses of 2 x 2 matrices 


Determine which of the following matrices are invertible. For each matrix 
that is invertible, write down its inverse. 


(a) A= E ) (b) B= (5 s) 


oc-(7 i) wn=(3 33) 


In the next activity you’re asked to verify that the formula for the inverse 
of a 2 x 2 matrix works in general. 


wl o= 


4 Simultaneous linear equations and matrices 


Activity 23 Verifying the formula for the inverse of a 2 x 2 matrix 
a b : 1 d —b 
Let A= (° J with ad~be# 0, and B=- ( 4 J 
Work out the products AB and BA, and deduce that B = A7!. 


Hint: use the fact that P(kQ) = k(PQ) for any matrices P and Q for 
which the product PQ is defined and for any scalar k. 


A matrix that does not have an inverse is called a non-invertible matrix. 
In some texts, non-invertible matrices are called singular matrices, and 
invertible matrices are called non-singular matrices. 


Determinants and inverses of larger matrices 


You have seen that a 2 x 2 matrix has an inverse only if its determinant is 
not zero. 


The determinant is defined for square matrices of any size, and it plays the 
same role as in the 2 x 2 case: any square matrix A such that det A 4 0 
has an inverse A~!. If det A = 0, then no inverse can be found. 


It is possible to use hand calculations to work out the determinant and the 
inverse (when it exists) of a square matrix whose size is larger than 2 x 2, 
but the process is rather complicated, and beyond the scope of this 
module. However, you can use the computer algebra system to calculate 
determinants and inverses of larger matrices. 


Activity 24 Working with matrix inverses on a computer 


Work through Subsection 10.2 of the Computer algebra guide. 


4 Simultaneous linear equations and 
matrices 


In Section 3 of Unit 2, you met simultaneous linear equations and you saw 
two methods (substitution and elimination) for solving them. Here you'll 
meet a third method, which uses matrices. 
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4.1 Solving simultaneous linear equations in two 
unknowns 


Consider the following simultaneous linear equations in the two 
unknowns x and y: 


0.4% + 0.2y = 4 


3 
0.62 + 0.8y = 11. (3) 


There’s a useful way to write equations (3) as a single equation involving 
two-dimensional vectors (which are 2 x 1 matrices). To do this, you write 
the sides of each equation as the components of a vector, as follows: 


0.44+0.2y\ (4 
0.67 +0.8y)/ U1)’ 


You can now write the left-hand side of this equation as the product of a 
2 x 2 matrix and a two-dimensional vector: 


(08 oa) G) = (i). 


The 2 x 2 matrix that appears in this equation is called the coefficient 
matrix of the simultaneous linear equations (3). Its elements are the 
numbers that appear on the left-hand sides of equations (3). Any pair of 
simultaneous linear equations in two unknowns can be written in this 
form. 


Matrix form of simultaneous linear equations 
The simultaneous linear equations 

UY Se 0] = e 

cx + dy = f 


can be written as the matrix equation 


ea- 


The matrix ¢ i) is called the coefficient matrix. 


d 


Note that the coefficient matrix of simultaneous linear equations depends 
on how the equations are written. For example, if you write equations (3) 
as 


0.62 + 0.8y = 11 
0.42 + 0.2y = 4, 


then you obtain the coefficient matrix 
0.6 0.8 
0.4 0.2)’ 


which is different from the coefficient matrix obtained above for the same 
equations. 


4 Simultaneous linear equations and matrices 


There are other possible solutions to Example 5. For example, the 
equations can be rewritten in the form 


—27+ y=-1 
Ox + 3y = 2, 
which gives the matrix form 
—2 1\/a\_ /-l 
0 3) WwW) ~ 2) 
In the next activity you can practise finding matrix forms of pairs of 
simultaneous linear equations. 


Activity 25 Writing simultaneous linear equations in matrix form 


Write each of the following pairs of equations in matrix form. 


a) 2r + 3y =3 b) 22-6 =0 c) 3a—4y = 18 
5 5 
x + 4y = -1 3x + 6y = 15 ir+ły=-1 
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The matrix form of a pair of simultaneous linear equations is not just 
convenient shorthand: it is the initial step in a method for solving the 
equations. Let’s see how this method works for equations (3) from the 
beginning of this subsection, which have the matrix form 


(03 os) G) = (a): 


Let A be the coefficient matrix; that is, 
0.4 0.2 
A= o a f 


Then the matrix form of the equations can be written as 


a(t) = Ga) 


Since det A = 0.4 x 0.8 — 0.2 x 0.6 = 0.2, the matrix A has an inverse A~!. 
If we multiply both sides of the equation above by A~!, then the result is 


a)-e) 


By the associativity of matrix multiplication, this equation can be 
rearranged as: 


win (a8) 


Since A~'A =I, where I is the 2 x 2 identity matrix, the equation above 
can be rewritten as 


(C) 


Since I is an identity matrix, our equation becomes 


o ° 


Therefore, if we work out A~! and calculate the product on the right-hand 
side of this matrix equation, then we’ll find a solution to the original 
equations. Now 


aa f 08 -02\_( 4 -1 
0.2 \—0.6 0.4 -3 2)}° 
So substituting for A~! in equation (4) gives 
a\ (4 -1\(4\_ (5 
y) 7 \-3 OP Naty 7 Uoj’ 


In other words, 


x=5 and y= 10. 


4 Simultaneous linear equations and matrices 


The example above illustrates a general method for solving simultaneous 
linear equations, which can be summarised as follows. 


Strategy: 
To solve a pair of simultaneous linear equations in two 
unknowns using matrices 


Write the simultaneous linear equations in matrix form Ax = b, 
where A is the coefficient matrix, x is the corresponding vector of 
unknowns, and b is the vector whose components are the 
corresponding right-hand sides of the equations. 


If the matrix A is invertible, then the solution is given by 


x=A'b. 


© 
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As usual, you can check the answer to Example 6 by substituting back into 
the original equations. 


You can practise the method in Example 6 in the next activity. 


Activity 26 Using matrices to solve simultaneous linear equations 


Use matrices to solve the following pairs of simultaneous linear equations. 


(You were asked to write these pairs of equations in matrix form in 
Activity 25.) 


a) 20+ 3y =3 b) 22-6 =0 c) 3x — fy = 18 
5 5 
x + 4y = -1 3x + 6y = 15 isp 8y=-1 


4.2 Non-invertible coefficient matrices 


You saw in Unit 2 that a pair of simultaneous linear equations in two 
unknowns can have no solutions, exactly one solution or infinitely many 
solutions. If it has exactly one solution, then we say that it has a unique 
solution. 


In this subsection you'll see that a simple way to tell whether a pair of 
simultaneous linear equations in two unknowns has a unique solution is to 
work out the determinant of its coefficient matrix. 


If the determinant is non-zero, then the coefficient matrix has an inverse 
given by the formula that you saw in Subsection 3.2, and hence the 
equations have a unique solution, by what you saw in Subsection 4.1. 


Let’s now look at what happens when the determinant is zero. Let’s start 
by looking at two examples. 


4 Simultaneous linear equations and matrices 


First, consider the following pair of simultaneous equations: 
2x — 3y = 5 
—4x + 6y = 7. 


The coefficient matrix is 


(i 9): 


which has determinant 2 x 6 — (—3) x (—4) = 0. Let’s look for a solution 
to the equations using the elimination method. If we multiply both sides of 
the first one of equations (5) by —2, then we get 

—4x + 6y = —10 

—4¢+6y= 7. 


These equations have identical left-hand sides, but different right-hand 
sides. So they have no solution: no values of x and y can make —4z + 6y 
equal to both —10 and 7. 


Now consider the following pair of simultaneous equations: 
x+2y=—6 
—3a — 6y = 18. 


The coefficient matrix is 


(3-4); 


which has determinant 1 x (—6) — 2 x (—3) = 0. Again, let’s look for a 
solution to the equations using the elimination method. If we multiply 
both sides of the first of equations (6) by —3, then we get 


—3x — y = 18 
—3x — 6y = 18. 


These equations are identical. So they have infinitely many solutions: all 
values of x and y that satisfy the first equation also satisfy the second 
equation. 


You’ve now seen two cases where the coefficient matrix has determinant 
zero. In the first case, the simultaneous equations had no solution. In the 
second case, they had infinitely many solutions. In fact, whenever the 
coefficient matrix of a pair of simultaneous linear equations has 
determinant zero, one of these two possibilities must occur. 


To see why, consider any two simultaneous linear equations 

ax + by =e 

ca + dy = f, 
and suppose that the determinant of the coefficient matrix is zero; that is, 
ad — bc = 0. We can assume that in each equation at least one of the 
coefficients is non-zero; otherwise, the equation is of the form Ox + Oy = 


constant, so the pair of equations has either no solutions or infinitely many 
solutions. 
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Suppose that in the first equation a Æ 0; a similar argument holds if b Æ 0. 
Since ad — bc = 0, we have d = bc/a. Hence c Æ 0, since if c = 0 then d = 0 
also. Multiplying the first equation by c/a gives 


cx + (bc/a)y = ce/a 
cz + dy = f. 
Since d = bc/a, these two equations can be rewritten as 
cx + dy = ce/a 
ca + dy = f. 
These equations have identical left-hand sides. So they (and hence the 


original equations) have either no solution (if ce/a # f) or infinitely many 
solutions (if ce/a = f). 


So, in conclusion, if the coefficient matrix has determinant zero, then the 
equations have either no solution or infinitely many solutions. 


We can deduce from this fact that if a 2 x 2 matrix has determinant zero, 
then it is not invertible. This is a fact that was mentioned in 

Subsection 3.2. To see how, consider any 2 x 2 matrix with determinant 
zero. If the matrix were invertible, then any pair of simultaneous equations 
with this matrix as its coefficient matrix would have a unique solution, 
which cannot be the case, since you saw above that if the determinant of 
the coefficient matrix is zero, then the equations have no solution or 
infinitely many solutions. 


So you have learned the following important payoffs from the matrix 
method for simultaneous linear equations. 


For a pair of simultaneous linear equations in two unknowns: 


e if the determinant of the coefficient matrix is non-zero, then this 
matrix is invertible and the equations have a unique solution 

e if the determinant of the coefficient matrix is zero, then this 
matrix is not invertible and the equations have no solution or 
infinitely many solutions. 


Thus, calculating the determinant of the coefficient matrix gives a quick 
test for the existence of a unique solution. 


Activity 27 Determining whether simultaneous equations have 
unique solutions 


Which of the following pairs of simultaneous linear equations has a unique 
solution? (You are not asked to find the solution.) 
(a) 5a—3y=5 (b) 3r- y=-6 

-x + 4y =7 —9x + 3y = 18 


4 Simultaneous linear equations and matrices 


4.3 Simultaneous linear equations in more than two 
unknowns 


The matrix method that you met in Subsection 4.2 can be used to find a 
solution for more than two simultaneous equations in more than two 
unknowns. 


In Subsection 5.3 of Unit 1, you learned that a linear equation is one in 
which, after you’ve expanded any brackets and cleared any fractions, each 
term is either a constant term or a number times a variable. For example, 
the equation 


r+2Qy-—z=3 
is linear, but none of the equations 
gt+y?=2, r+yz=1 and r+% =1 


are linear. In this subsection we’ll usually write a linear equation in a form 
similar to the linear equation above, that is, with its left-hand side as a 
sum of terms, each of which is a number times a variable (and with each 
variable appearing just once), and with the right-hand side as a number. 


We'll refer to a collection of linear equations in a given set of unknowns as 
a system of linear equations. For example, here’s a system of three 
linear equations in three unknowns, x, y and z: 


2x + 2y+3z=2 
2x + 3y + 4z = —1 
4x + Ty + 102z = 3. 


A solution to a system of linear equations is an assignment of values to 
the unknowns that makes all the equations hold simultaneously. As in 
Subsection 3.1 of Unit 2, the process of finding these values is known as 
solving the equations simultaneously. We’ll look only at systems where the 
number of unknowns is equal to the number of equations. 


Numerical problems that can be modelled by systems of linear equations 
often arise in applications. Consider, for example, a bank that offers three 
types of investment to its customers: a financial product A that pays 
guaranteed interest of 2.5% over two years and is quite safe, and two 
financial products B and C that in the same time period are likely to yield 
interests of 5% and 6% respectively, but are more risky. 


Suppose that a customer is seeking to invest £9000 over two years, and is 
advised to spread his investment across the three types of financial 
products. He is risk-averse, so he wants to invest a total of only £4000 
across the two riskier products B and C. Of this £4000, he wants to invest 
just enough in the higher-yielding product C to ensure that his total 
£9000 investment is likely to yield an overall return of 4%, with the 
remainder of the £4000 invested in the lower-yielding product B, to spread 
his risks. How should he spread his investment across the three products in 
order to achieve his desired return? 
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First of all, the amounts invested in each product must add up to £9000. 
So, if x, y and z represent the amounts invested (in £) in products A, B 
and C, respectively, then 


£+ y+ z = 9000. 


Moreover, the investor wants to invest a total of £4000 in products B 
and C, so 


y + z = 4000. 


Since the interest on his investment in product A is 2.5%, the investor’s 
expected interest payment from his investment in product A is 0.025x. 
Likewise, his expected interest payments from his investments in products 
B and C are 0.05y and 0.06z, respectively. The investor is seeking an 
overall return of 4%, that is, an overall interest payment of 

0.04 x £9000 = £360. Thus the amounts invested in the three products 
must also satisfy the equation 


0.0252 + 0.05y + 0.062 = 360. 


Therefore, the amounts x, y and z must simultaneously satisfy the three 
linear equations 


z+ y+  2=9000 
y+ z= 4000 (7) 
0.0252 + 0.05y + 0.062 = 360. 


Such systems of linear equations can be solved efficiently by methods 
similar to elimination and substitution — you can probably see that 
equations (7) can be solved fairly easily for x, y and z by 


e first subtracting the second equation from the first to give x = 5000 


e then substituting this value for x into the third equation and solving 
the resulting two simultaneous equations for y and z. 


Methods of this sort are covered in detail in other modules. Systems of 
linear equations like those above can also be solved by using matrices, and 
the rest of this subsection covers this approach. 


Any system of n linear equations in n unknowns can be written as a single 
matrix equation 


Ax = b, 


where A is an n x n coefficient matrix, x is an n-dimensional vector whose 
components are the unknowns, and b is an n-dimensional vector, as you 
saw for the case n = 2 in Subsection 4.2. If the coefficient matrix A is 
invertible, then the matrix equation can be solved by multiplying both 
sides by the inverse of A, giving x = A~'b, just as in the 2 x 2 case. 


In Section 3 you learned how to use a computer to find the inverses of 
matrices of size 3 x 3 and larger. This enables you to use the matrix 
method to solve systems of three or more linear equations. In the next 
example, this method is used to solve equations (7). 


4 Simultaneous linear equations and matrices 


Example 7 Solving a system of three linear equations 


Use the matrix method to solve the following system of three linear 
equations. 


c+ yt 2z=9000 
y+ z= 4000 
0.0252 + 0.05y + 0.06z = 360 
Solution 


The matrix form of the system of three equations is 


1 io i z 9000 
0 io i y | = | 4000 
0.025 0.05 0.06/ \z 360 


@. Use the computer algebra system to find the inverse of the 
coefficient matrix. ® 
The inverse of the coefficient matrix is 
Il —1 0 
D5 3.5 —100 
—2.5 —2.5 100 


@. Multiply each side of the matrix equation by the inverse of the 
coefficient matrix. ® 


Therefore the solution is given by 


T il =I 0 9000 5000 
y| = 2.5 3.9 —100 4000 | = | 500 
Z =2 =25) OO 360 3900 


Note that you could use the computer algebra system to solve the 
simultaneous equations (7) directly, that is, without first finding the 
inverse of the coefficient matrix, and so check the answer to Example 7. 


The solution to equations (7) found in this example tells our investor that 
the best way to spread his investment in order to achieve a 4% return is as 
follows: 


£5000 on investment A 


£500 on investment B 
£3500 on investment C. 


You can practise finding inverses of matrices and using them to solve 
systems of more than two linear equations in the next activity. 
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ne Activity 28 Solving systems of three linear equations 


Solve each of the following systems of simultaneous linear equations by 
using the computer algebra system to find the inverse of the coefficient 


matrix. 

(a) 2x + 2y +3z=2 (b) 3r +2y+z=1 
2x + 3y + 4z = —1 4r + 3y +z =2 
4r + Ty + 10z =3 Tz +5y+z=1 


In fact using matrix inverses is not usually the most efficient way to solve 
systems of linear equations, but the method has theoretical importance. 


For example, it can be shown that, for all n > 2, 

e ifthe determinant of the coefficient matrix of a system of n equations 
is non-zero, then this matrix is invertible and the equations have a 
unique solution 

e ifthe determinant of the coefficient matrix of a system of n equations 


is zero, then this matrix is not invertible and the equations have no 
solution or infinitely many solutions. 


You saw the case n = 2 of this result in Subsection 4.2. 
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Learning outcomes 


The following activity provides further practice on the significance of the 
determinant of the coefficient matrix for systems of three or more linear 
equations. 


Activity 29 Determining whether systems of linear equations have 
unique solutions 


Which of the following systems of linear equations have a unique solution? 


(a) c+2y+3z=5 (b) 2a + 2y+3z=5 
2xr+3y+ z=6 20+ 3y+ z2=6 
4x +T7y+7z=1 4x + Ty+7z=1 


(c) «+ y- wt+z=0 
t= ytwt+2z=2 
2x +3w-—z=1 
6x + 2y + 4w =0 


Learning outcomes 


After studying this unit, you should be able to: 

e use matrix notation 

e evaluate sums, differences and scalar multiples of matrices 
e establish whether two matrices can be multiplied together 


e evaluate a matrix product both by hand and by using the computer 
algebra system 


e determine whether a given n x n matrix has an inverse 


e work out the determinant and inverse (if it exists) of a 2 x 2 matrix by 
hand 


e use the computer algebra system to work out the determinant and 
inverse (if it exists) of a square matrix 


e write a system of linear equations in matrix form and solve it using 
matrix methods where possible. 
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Solutions to activities 


Solution to Activity 1 Solution to Activity 4 
We have a33 = 3, a34 = —4, a43 = =T, 7 A -3 12 9 
a14 = —1, a44 = —9, a42 = —5. arD la ilke a 
The matrix A has four rows and four columns, so 


its size is 4 x 4. (ii) 


Solution to Activity 2 


(a) A+ B= C i) n (3 5) _ (° 3 i 
1 


20 
o A+B= (3 6 o/*\o 1 i 
= 1/9 
(30-3 Gi) J= 
-1 7 1 3 
aah 2x 3x 2 3 
o aspe(*)4(3 >) a (F s) 5ra 5) 
3 ki In each of the answers in this part, there is 
= ( T a m ] more than one way to choose the scalar 


multiple. For example, another correct 
solution to part (i) is 


Solution to Activity 3 45 3 18 12 
( 2 3 =025 (3 a! 


Solution to Activity 5 


3 
2 —2 =f 2 6 We have 
1 0 —2 2 - 1 2 0 -1 
( 3 
-+ 
2 


= plo 


0 -r a-c= (i -2 ae 
z o) nT | ae 


ag 5 —5 —5 
Next, 


1 24 
2a = (5 4 o): 


The matrix 2A has size 2 x 3 and B has size 2 x 2; 
hence the sum B + 2A is not defined. 


Finally, 


wat TA 1 24 
“A=1 8 5 8 —4 0 


270 


Solution to Activity 6 


For each pair of matrices in this question, the sizes 


of the matrices are written underneath the matrices, 


and the middle two numbers are boxed in order to 
use the method described in the text. 


2 9 


3x|2] [2|x3 


The middle numbers are the same, so these 
matrices can be multiplied together. Their 


product is 
2 9 TE 32 6 47 
pil ( 40 3 = | —10 21 12 
—6 8 44 —18 34 


9 
(b) (1 2 3){/8 


1x[3 | [3 ]x1 


The middle numbers are the same, so these 
matrices can be multiplied together. Their 


product is 
9 
(1 2 3){8]} = (46) 
7 
2 9 0 10 
(c) © 4 1 
—6 8 —2 7 


3x|2] a 


The middle numbers are different, so these 
matrices cannot be multiplied together. 


9 
(a) {8 


3x1] [1]x3 


The middle numbers are the same, so these 
matrices can be multiplied together. Their 


(1 2 3) 


product is 
9 9 18 27 
8| (G 2 3)= [8 16 24 
7 7 14 21 


Solutions to activities 


ax|3| [3 ]x3 


The middle numbers are the same, so these 
matrices can be multiplied together. Their 
product is 


pierces) 
biG APNG a. g 15 1 4 


Solution to Activity 7 


(a) (i) Here A has size 2 x 2 and B has size 2 x 1. 
For AB, the size check is 


ax| 2] [2 ]x1. 


The middle numbers are the same, so the 
product AB is defined. 


For BA, the size check is 


ax{1| [2 ]x2. 


The middle numbers are different, so the 
product BA is undefined. 


(ii) Here both A and B have size 2 x 2, so the 
products AB and BA are both defined. 


(b) For the matrices in part (a)(i), the product BA 
is not defined. 


G9 


For the matrices in part (a)(i 


“(EHC 
) 


m- ( JCG 


Therefore AB 4 BA. 


Solution to Activity 8 


(a) The size of A is 1 x 2 and the size of B is 2 x 2. 
The size check for the product AB is 


1 x2 | [2 |x 2. 
Thus AB has size 1 x 2. 


The size of C is 2 x 3. The size check for the 
product BC is 


ax|2| [2 |x 3. 


Thus BC has size 2 x 3. 
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We have 
AB=(-1 0) € jn 1), and 
3 
4 


Be 71) (3 
=(5 72 <i); 


(b) From part (a), the size of AB is 1 x 2. Since C 


has size 2 x 3, the size check for (AB)C is 


1x2] [2]x3. 


So the product (AB)C exists and has size 1 x 3. 
From part (a), the size of BC is 2 x 3. Since A 


has size 1 x 2, the size check for A (BC) is 


1x2] Bee 


So the product A(BC) exists and has size 1 x 3. 
Using the answers for the matrices AB and BC 


found in part (a) we obtain 


(AB)C = (-1 1) (3 i i 
=(-4 1 1) 
and 
A(BC) = (-1 0) € E =) 
=(-4 1 1). 


We can see that (AB)C = A(BC). 


Solution to Activity 9 


We have 
1 0 2 0 
m=2(2 1)=( 2): 
SO 
raya 10 4 
A(aB) = (5 a) G F :) 
Also, 


(i a) 


Therefore A(2B) = 2(AB), as required. 
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Solution to Activity 10 
We have 


B+c= (3) + (71) 7 


assac=(3)+()=(33). 


Thus A(B + C) = AB+ AC, as required. 


Solution to Activity 11 


(a) 


The product of two matrices A and B can be 
formed only if the number of columns of A is 
equal to the number of rows of B. Since M has 
two rows and three columns, the number of its 
rows is not equal to the number of its columns. 
Thus the product MM cannot be formed. 


The number of rows of N is equal to the 
number of columns of N, so the product NN is 
defined. We have 


e (2 2) =(6 -f) 


A matrix A must have the same number of 
rows as columns in order for the product AA to 
be defined. So only matrices that have the same 
number of rows as columns can be squared. 
(You might remember that such matrices are 
called square matrices.) 


Solution to Activity 12 


(a) 


2 


GIGA J- 


T. O I 
0 2 0 
0 0 3 


(b) 


From the text preceding this activity, we know 
20 7 
3 
that A? = & i Therefore 


Aĉ = A’ A?’ = (A8)? 
Jan _ (20 Fy (20: 7 
mare) ri 6) Oi i 


_ (547 182 
= \546 183) ° 


Solution to Activity 14 


(a) 


Inputting 1 litre of water at node U gives 
0.2 litres of water output at node C and 
0.8 litres of water output at node D. 


Inputting 2 litres of water at node V gives 
2 x 0.6 litres of water output at node C and 
2 x 0.4 litres of water output at node D. 


So, in total, we have 0.2 + 2 x 0.6 = 1.4 litres of 
water output at node C, and 0.8 + 2 x 0.4 
= 1.6 litres of water output at node D. 


Using a method similar to that of part (a) gives 
0.2% + 0.6y + 0.3z litres of water output at 
node C and 0.8x + 0.4y + 0.7z litres of water 
output at node D. 


Solution to Activity 15 


(a) 


If £2000 is input at node D, then the output at 
node J is 0.2 x £2000 = £400. 


Similarly, the output at node K is 
0.8 x £2000 = £1600. 


If the input at node D is £2400, at node F is 
£1700, and at node F is £1100, then the 
output (in £) at node J is 

0.2 x 2400 + 0.3 x 1700+ 0.5 x 1100 = 1540, 
and at node K is 

0.8 x 2400 + 0.7 x 1700 + 0.5 x 1100 = 3660. 
If the input (in £) at node D is x, at node E 


is y, and at node F is z, then the output (in £) 
at node J is 


0.22 + 0.3y + 0.52, 
and at node K is 
0.82 + 0.7y + 0.52. 


Solutions to activities 


Solution to Activity 16 


(a) 


(b) 


Since there are three input nodes and two 
output nodes, the network can be represented 
by a 2 x 3 matrix. 


The number that labels the arc from the first 
input node (U) to the first output node (C) 
is 0.2, and this goes in the first row and first 
column. The number that labels the arc from 
the second input node (V) to the first output 
node (C) is 0.6, and this goes in the first row 
and second column. 


Proceeding similarly for all the inputs and 
outputs of the network gives the following 
matrix: 


U V W 
C (0.2 0.6 0.3 
D\08 0.4 0.7 )° 
The vector that represents an input of 1 litre of 
water at nodes U, V and W is 


1 
1 
1 


The output vector for the input vector found in 
part (b) is 


0.2 0.6 0.3 i 
0.8 0.4 0.7 1 
 (/02x1+0.6x1+0.3x1\_ /11 
~ \0.8x1+04x14+0.7x1)  \1.9)° 
This vector represents an output of 1.1 litres of 


water at node Č and 1.9 litres of water at 
node D. 
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Solution to Activity 17 (d) Labelling the matrix that represents the 
(a) The matrix that represents the network with combined network gives 
nodes U, V, P and Q is U V 
U v A (03 0.3 
B {0.2 0.14 
P (0.5 0.2 
Q (o 5 0 :) . C \0.5 0.56 


A simplified network equivalent to the original one 
The matrix equation giving the output a for is given below. 


. u\. 
the input @ is 
p\ (0.5 0.2\ fu 
q} \0.5 0.8) Ww)’ 


(b) The matrix that represents the network with 
nodes P, Q, A, B and C is 


P Q 
A /0.3 0.3 
B {03 0.1 
C \0.4 0.6 
a 
The matrix equation giving the output | b | for 
c (e) An input of 1 litre at node U and 2 litres at 
the input e) is node V is represented by the input vector : 
03 03 Using the equation for the combined network 
; 7 0. 3 A i C) found in part (c), the corresponding output is 
c 0.4 0.6 Pa 
a 0.3 0.3 1 0.9 
a b] ={|0.2 0.14 ( ) = | 0.48 
(c) The matrix equation giving the output | b | for c 0.5 0.56 1.62 
Therefore the output is 0.9 litres at node A, 
an input a in the combined network is 0.48 litres at node B and 1.62 litres at node C. 
obtained by using parts (a) and (b) as follows. Solution to Activity 18 
a 0.3 0.3 p (a) The matrices (with row and column headings) 
b = 0 3 0 1 ( ) are 
E 0.4 0.6 q D E F JI K 
0.3 0.3 J (0.2 0.3 0.5 R/0.6 0.9 
sjo orli (a e K ts 0.7 a] am og Ge ia): 
0.5 0.8 v 
0.4 0.6 respectively. 
0.3 0.3 0.5 0.2 ú (b) The matrix that represents the combined 
= 0.3 0.1 0.5 0 T network is 
0.4 0.6 0.6 0.9\ /0.2 0.3 0.5 
0.3 0.3 0.4 0.1 0.8 0.7 0.5 
u 
= [0.2 0.14 (*) _ (0.84 0.81 0.75 
0.5 0.56 ~ (0.16 0.19 0.25)” 
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(c) The simplified network is shown below. 


R S 


(d) If the input vector (in £) to the combined 
network is 
1700 
1400 
0 


then the output vector (in £) is 


0.84 0.81 0.75 Lay _ (2562 
: : : 0 ~— \ 588] ° 


? 


0.16 0.19 0.25 icy 
So £538 is spent in region S. 


Solution to Activity 19 


The products AI, IA and IB are defined, and they 
give the following results: 


a2) )-(4 Da 
aG GE -G Des 
m4 YEI J-C- 


The product BI is not defined ne B has 
3 columns and I has 2 rows. 


Solution to Activity 20 
We have 


Solutions to activities 


Therefore AI = A and IB = I, as required. 


Solution to Activity 21 


w= (3 9-69) 


2 3 1\ /0 1 ~ 

b) CD=[-1 1 0] ]0 2 -1 

—2 1 O/ U -8 5 
100 
={0 1 0l, 
001 

0 1 =o 2 3i 

DC=ļ|0 2 -1]{-1 1 0 

1 -8 5/\-2 10 
10 0 
={0 1 0 
001 


Solution to Activity 22 


(a) We have det A =13x2-—5x5=1#0. 
Thus A is invertible and 


1_f{ 2-5 
=(5 za 


(b) We have det B = 3 x 2 — 0 x (— 
Thus B is invertible and 


B= 5 (5 y-G :) 
= =\5 aj: 
6\5 3 53 


(Either of these two forms of the inverse is 
acceptable as your final answer.) 

(c) We have det C = —3 x (—4)- 3 x5 = 0. 
Thus C is not invertible. 


(d) We have det D = 1.5 x 1.5 — 2.5 x 0.5 = 1 £ 0. 
Thus D is invertible and 


fie -25 
S m a: 


5)=6 #0. 
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Solution to Activity 23 


Using the property of matrix products given in the 
hint, we have 


sae © 1 d —b 
~ \e d) \ad—bce\-c a 
> 1 a b d —b 
~ ad—bc\c dj \—c a 


(=r 
E 


o 1 ad — be 0 
~ ad — be 0 ad — bc 


Hence AB = BA =I, and it follows that B = A7}. 


Solution to Activity 25 
(a) The coefficient matrix of the equations 
22+ 3y=3 
x + 4y = —1 
n [2 3 : 
is k Thus, the matrix form of the 
equations is 
2 3\/z\_ 3 
1 4) \y) \-1/)° 


(b) First rearrange the first equation as 


2x = 6. 

The matrix form of the equations 
22 =6 
3x + 6y = 15 


is 
2 0\ /x\ (6 
3 6) Wy] 7 u5)’ 
(c) The matrix form of the equations 
3x = $y =18 
set 5y=—l 
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HG@-C). 


Solution to Activity 26 


(a) 


— 
Z 


Activity 25(a) gives the matrix form of the 
equations as 


(a) G)=G) 

1 4) \y —1)° 

The determinant of the coefficient matrix is 
2x4-—3x1=5, 

so the inverse of the coefficient matrix is 
a 4 = 3 E 
sla Tl 3 


Hence 


The solution is z = 3, y = —1. 


Activity 25(b) gives the matrix form of the 
equations as 


2 O\ /z\_/6 
3 6) Nay A” 
The determinant of the coefficient matrix is 


2x6-0x3=12, 


so the inverse of the coefficient matrix is 


LP SO) 
12\-3 2/7 


Hence 


e\ 1/6 0\/6\_ (3 
y) 2\-3 2] u5) uz’ 
The solution is z = 3, y = 1. 


Activity 25(c) gives the matrix form of the 
equations as 


[ T (5) = ( 1) 

4 3 TE a g 

5 5/ W ? 

The determinant of the coefficient matrix is 


px- Axt 


so the inverse of the coefficient matrix is 


)- 


ow oud 


ow oe 


le 
(9-9) 


The solution is x = 10, y = —15. 


Solution to Activity 27 
(a) The matrix form of the equations is 
G DGG 
-1 4 yY TI 
The determinant of the coefficient matrix is 
5 x 4—(-3) x (-1) = 17. 
This is non-zero, so the coefficient matrix is 


invertible and the equations have a unique 
solution. 


(b) The matrix form of the equations is 
(5 3) @) = Gs) 
-9 3 y 18/` 
The determinant of the coefficient matrix is 
3 x 3—(-1) x (-9) = 0. 


This is zero, so the coefficient matrix is 


non-invertible and the equations do not have a 


unique solution. 


Solution to Activity 28 


(a) The matrix form of the equations is 


2 2 3 5i 2 
23 4 yj = {=l 
4 7 10 zZ 3 
We have 
23 3)" 1 -i 
23 4 = |-2 4 —1 
4 7 10 1 -3 
Therefore 
z 1 4 -$ 2 0 
GLa ae 4 -1 -1 | ={-11 
z 1 -3 1 3 


The solution is z = 0, y = —11, z = 8. 


(b) The matrix form of the equations is 


3 2 1\ /a 1 
43 1\)\ly|= [2 
7 5 IJ œ 1 


Solutions to activities 


We have 
3 2 1\ 7! E S | 
4 3 1 = |-3 4 —l 
To 1 1 1 -l 
Therefore 
An 2 -—3 1 
yJ=|—-38 4 -1 
z 1 1 -l 
The solution is z = —3, y = 4, z = 2. 


(Details of how to use the CAS to find the 
inverses of the matrices are given in the 
Computer algebra guide, in the section 


‘Computer methods for CAS activities in 
Books A-D’.) 


=. Ne 
Il 
a 


Solution to Activity 29 


(a) 


The matrix form of the equations is 


1 2 3 x 5 
2 3 1 y) =16 
4 7 7 z 1 


The determinant of the coefficient matrix is 
zero, so the coefficient matrix is non-invertible 
and the equations do not have a unique 
solution. 


The matrix form of the equations is 


2 2 3 x 5 
2 3 d y) =16 
4 7 7 z 1 


The determinant of the coefficient matrix is 14. 
This is non-zero, so the coefficient matrix is 
invertible and the equations have a unique 
solution. 


The matrix form of the equations is 


1 1-1 #1\ fe 0 
1-1 1 1]fy]_ {2 
2 0 3 2) }o) |1 
6 2 4 OW le 0 


The determinant of the coefficient matrix is 
zero, so the coefficient matrix is non-invertible 
and the equations do not have a unique 
solution. 


(Details of how to use the CAS to find the 
determinants are given in the Computer algebra 
guide, in the section ‘Computer methods for 
CAS activities in Books A—D’.) 
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